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Predicting Earnings Using a Model Based 
on Cost Variability and Cost Stickiness 
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ABSTRACT: We evaluate the descriptive validity of the cost behavior model for profit 
analysis using Compustat data. For this purpose, we propose ап earnings forecast 

5 model decomposing earnings into components that reflect (1) variability of costs with 
sales revenue and (2) stickiness in costs with sales declines. We evaluate the predictive 
ability of our mode! by benchmarking its performance in forecasting one-year-ahead 
retums on equity against that of twó other time-series models based on line item In- 
formation reported in the Income statement and in the statement of cash flows. Spe- 

. cifically, we consider a model that disaggregates earnings into operating income and 
non-operating Income components and another that disaggregates earings into cash 
flows and accruals components. While all three models are less accurate than analysts’ . 
consensus forecasts that rely on a larger informatlon set, we find that our model pro- 
vides substantial Improvement in forecast accuracy over the other two models that use 
only the line items In the financial statements. Finally, invoking the market efficiency 
assumption, we find that earnings forecast errors based on our model have greater 
relative information content than forecast errors based on the two alternative models 
based on financial statement information in explaining abnormal stock returns. 


Keywords: analyst forecasts; cost variability; eamings forecasts; fixed costs; market 
association; predictive content; sticky costs; variable costs. 
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1 Data Availability: Data used in this study are available from public sources Identified 

a : in the paper. 

3 

B L INTRODUCTION 

| nderstanding cost behavior is one of the most important aspects of profit analysis 
г | | for managers. Central:to the cost-volume-profit analysis discussed in most mana- 
З gerial accounting textbooks is the traditional model of fixed and variable costs 


j We gratefully acknowledge helpful comments and suggestions from participants in research seminars at University 

Y of , University of California, San Diego, Michigan State University, Rice University, Temple University, 
and The University of Texas at Dallas, from the discussants of our paper and other participants at the 2004 
Management Accounting Section and the 2004 Financial Accounting and Reporting Section Midyear Meetings, 
and from two anonymous reviewers. We also thank Thomson Financial I/B/E/S for providing the data on analysts’ 
earnings forecasts. : 


Editor's note: This paper was accepted by Lawrence Brown, Editor. 
; Submitted July 2003 
| Accepted August 2005 
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(Garrison and Noreen 2002). If this model is descriptively valid, then its estimation using 
past data should provide a basis for forecasting future earnings. Despite the conceptual 
importance of incorporating cost behavior in conducting profitability analysis, few empirical 
studies have systematically examined the forecasting ability of models that explicitly rec- 
ognize the relation between costs and sales when forecasting future profits. Our purpose is 
to evaluate the performance of such a cost behavior model in forecasting earnings by 
benchmarking the predictive performance of our model against that of alternative time- 
series models using earnings component classifications reported in the income statement 
and in the statement of cash flows.! 

Prior research addresses the question of forecasting earnings from an externa: reporting 
perspective; that is, earnings forecast models are based on line items reported in the income 
statement (Lipe 1986; Fairfield et al. 1996), on accounting signals calculated from financial 
statements (Ou and Penman 1989), or on components of earnings such as cash flows and 
accruals (Sloan 1996). We propose an approach grounded in the management accounting 
tradition that models earnings as consisting of components proportional to sales increases 
and decreases, and another unrelated to sales changes. This method of earnings decompo-. 
sition is motivated by a cost behavior model that recognizes sales as the key driver of profit 
and variable costs as varying with sales. In addition, we incorporate in our model evidence 
that cost changes exhibit stickiness in periods of sales decline (Noreen and Soderstrom 
1997; Anderson et al. 2003). Cost stickiness refers to the phenomenon that costs decrease 
less with a sales decrease than they increase with an equivalent sales increase. Economic 
considerations such as adjustment costs of reducing capacity and ramping up capacity in 
case sales rebound, motivate managers not to cut back on activity resources in response to 
a decline in sales to the same extent that they increase resources in response to єп increase 
in sales (Anderson et al. 2003). This asymmetric cost behavior implies asymmetric behavior 
of earnings with respect to sales changes. Our cost behavior based approach to forecasting 
earnings is consistent with recent calls to incorporate economic considerations in models 
characterizing earnings components (Kothari 2001, 151). 

Our model based on cost variability and cost stickiness (CVCS) is informationally 
parsimonious, relying only on earnings and sales time-series data. To evaluate the predictive 
ability of our CVCS model based on information drawn from the financial statements, we 
benchmark its performance against four other forecast mode!s, including two that rely on 
information on past earnings components to predict future returns on equity (КОБЕ)? The 
first model (Fairfield et al. 1996) uses income statement line items to classify earnings into 
operating income and non-operating income components to generate forecasts of future 
ROE (hereafter, the OPINC model). The second model (Sloan 1996) predicts future ROE 
based on past cash flows and accruals components of earnings (hereafter, the CASHFLOW 
model). The interest in earnings decomposition in forecasting stems largely from the belief 
that earnings components have different time-series properties and incorporating such in- 
formation through earnings disaggregation provides a more accurate forecast of future earn- 
ings (Fairfield et al. 1996; Kothari 2001). To provide a baseline benchmark, we also consider 
a simple forecast model based only on past ROE (hereafter, ће ROE model). Prior research 


! We describe our proposed model and other benchmark models as “time-series” models because they are based 
on a time-series correlation between current and past realizations of components of earnings. However, the 
empirical estimation of these models may be done on a cross-sectional or pooled basis, rather than on separate 
time-series data of individual firms to improve precision of the estimated model coefficients. 

2 We focus on forecasting ROE to be consistent with prior literature (Fairfield et al. 1996; Frankel and Lee 1998). 
However, results not reported in the paper indicate that the documented superior forecasting performance of our 
CVCS model relative to other time-series models we consider is robust tc forecasting return on assets (ROA). 
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- provides evidence that analysts’ earnings forecasts are more accurate than time-series mod- 

els due to their information advantage in collecting, processing, and aggregating multiple 
sources of financial and nonfinancial information for forecasting earnings (Brown et al. 
1987a; O’Brien 1988). Therefore, we also evaluate the performance of models based on 
earnings disaggregation relative to that of analysts’ consensus forecast to gauge the extent 
to which different methods of earnings decomposition succeed in capturing some of the 
analysts’ information advantage. 

Our primary objective is to evaluate whether models that explicitly incorporate the 
relationship between cost changes and sales changes are better in capturing the behavior 
of the earnings time-series than do models that ignore such a relationship between income 
statement items.? Using a sample of 8,771 firms in the period 1992—2002, we provide 
evidence that our proposed CVCS model substantially improves forecast accuracy of one- 
year-ahead ROE over the ROE, OPINC, and CASHFLOW models. Specifically, we find 
that the reduction in median absolute forecast errors using our CVCS model relative to the 
ROE model is about nine times as large as the reduction in median absolute forecast errors 
-using the OPINC or the CASHFLOW model relative to the ROE model. We also find that 
recognizing cost variability and cost stickiness in earnings decomposition reduces over 30 
percent of the advantage in median absolute forecast errors that financial analysts’ forecasts 
display relative to the ROE model.* 

Invoking the market efficiency hypothesis, we examine whether our CVCS model has 
greater relative information content in capturing the market’s earnings expectations than the 
other three time-series models. Relative information content is assessed based on the relative 
strength of the association between forecast errors in ROE from each of these models and 
the contemporaneous abnormal stock returns. Using a sample of 4,348 firms in the period 
1992—2002, we find that our CVCS model also outperforms the other three models in terms 
of relative information content. The evidence suggests the information in earnings com- 
ponents that pertains to cost variability and cost stickiness is correlated with a part of the 
information set used in forming the capital market’s earnings expectations. 

We organize the remainder of the paper as follows. In the next section we discuss 
earnings forecasting using earnings components and present our contrasting method of 
earnings decomposition based on cost variability and cost stickiness. We describe the sam- 
ple and forecast models in Section III, and we report results of forecast accuracy compar- 
isons in Section IV. In Section V, we present tests of relative information content of our 
model in evaluating the association between earnings forecast errors and abnormal stock 
returns. Finally, in Section VI we offer concluding remarks and suggest directions for future 
research. 


Ц. EARNINGS DECOMPOSITION AND FORECASTING 
Earnings Forecasting Using Earnings Components 
Our study is related to a branch of the earnings forecasting literature concerned with 
the use of earnings components in predicting future earnings (Brown 1993). Fairfield et al. 
(1996) and Kothari (2001) suggest that partitioning earnings into components with different 


3 We restrict our attention to parsimonious models based only on income statement and cash flow statement data, 
and do not attempt to develop forecast models (analogous to analysts’ forecast models) relying on a wide array 
of information beyond just financial statements. However, the model structure we develop to link sales changes 
to cost changes, and consequently to earnings changes, may be augmented to incorporate economy-, industry-, 
and firm-specific information affecting sales or cost time-series. 

* The corresponding reduction in median absolute forecast errors is less than 6 percent for the OPINC and 
CASHFLOW models. 


The Accounting Review, March 2006 


288 Banker and Chen 


levels of persistence may enable superior modeling of the earnings process and improve 
the accuracy of earnings forecasts. Two time-series forecast models using past earnings 
components to predict future earnings have attracted much attention in the prior literature. 
One such model decomposes current earnings into cash flows and accruals components and 
predicts future earnings using them. Sloan (1996) provides evidence that cash flows have 
higher persistence than accruals for assessing one-year-ahead earnings but he does not 
examine the out-of-sample forecasting performance of this model because his focus is on 
whether the market fully incorporates the information about the differential persistence of 
cash flows and accruals components. The other earnings-decomposition-based forecast 
model uses line items in the income statement as the basis for predicting future earnings. 
Fairfield et al. (1996) find that decomposing earnings into operating income and non- 
operating income improves the accuracy of one-year-ahead forecasts of returns on equity, 
relative to a basic model using only aggregate earnings, although further disaggregation 
does not result in more accurate earnings forecasts. 

While these two methods of decomposition seem natural and intuitive from the stand- 
point of financial reporting, they do not fully exploit the information about the relationships 
among earnings components suggested by tbe underlying economics of firms' production 
processes. We propose an alternative approach to earnings decomposition that reflects the 
earnings generating process as the outcome of a more primitive sales-generating process. 
At the operational level, this approach is derived from a management accounting perspective 
that views accounting income as sales revenue net of variable and fixed expenses, with the 
explicit recognition of the relationship between revenue and variable expenses. To the extent 
that variable costs are driven by sales revenue, accounting earnings may be modeled as 
consisting of two components, the first corresponding to contribution margin varying with 
sales, and the second corresponding to fixed costs uncorrelated with sales. The focus on 
sales as the key profit driver allows us to model the earnings process as reflecting the 
underlying economics of the firms' process of converting resources (costs) into outputs 
(sales). By relying on such a conceptual framework that highlights the intercorrelation of 
earnings components, we can incorporate into our forecast model potential asymmetric cost 
behavior related to sales changes. This constitutes a fundamental difference between our 
approach and prior earnings-decomposition-based forecast models. In the next section, we 
discuss in detail how we model such asymmetry (i.e., cost stickiness). 

A similar perspective focusing on the prominent role of sales as the driver of earnings 

“is adopted by Dechow et al. (1998), who model earnings, cash flows, and accruals as 
stochastic processes conditional on a random walk sales-generating process. They argue 
that sales contracts are more primitive than cash receipts and the modeling of the sales 
process should precede the modeling of the earnings and cash flows processes. The fun- 
damental analysis of security valuation that uses signals related to inventory, gross margin, 
receivables, SG&A, order backlog, and labor efficiency that are all benchmarked against 
sales performance (Lev and Thiagarajan 1993) is also consistent with this sales-centric view 
of evaluating economic performance. 


* As discussed in Kothari (2001), one motivation for studying the properties of earnings components is to assess 
their incremental information beyond earnings in their association with stock prices. Earnings forecast models 
based on earnings components are often implicitly assumed in such research contexts, even though some studies 
may not explicitly examine these models of predictive ability (e.g., Lipe 1986). 
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Cost Variability and Cost Stickiness (CVCS) Model 


We consider accounting earnings (Е,) in period г, measured as sales revenue (S,) net of 
costs (С,):6 


E = 8, – С, (1) 


Management accounting textbooks describe two basic types of cost behavior patterns: vari- 
able costs and fixed costs (Garrison and Noreen 2002). Variable costs are defined as those 
that change in proportion to changes in sales volume, whereas fixed costs are characterized 
as those that remain unchanged in a relevant range. From an economic perspective, variable 
costs represent the flexible productive resources that managers can easily adjust in produc- 
ing goods and services, whereas fixed costs represent the committed ‘resources invested to 
provide long-term productive capacity and thus are not expected to change with short-term 
production volume. Such an economic interpretation of different earnings components 
serves as a conceptual basis for modeling the earnings-generating process conditional on 
the relationship between sales and costs. Using this cost model with sales revenue as the 
cost driver, we express total costs in period ¢ as: 


C, = vS, + F, (2) 


where v represents the constant unit variable cost. 
It follows from Equations (1) and (2) that: 


Е, = (1 – 0)8, — Е, (3) 


We assume that 5, follows a first order autoregressive process and allow ће AR(1) 
coefficient to depend on whether sales decrease in period t—1:’ 


S, = ag + 019,1 + Es (4) 


where parameter o, represents the persistence of sales. 

Anderson et al. (2003) document that costs are sticky in that they decrease less with a 
sales decrease tban they increase with a sales increase. Such sticky cost behavior arises if 
trimming off excess resources when demand declines is relatively more costly than scaling 
up resources to accommodate increased demand. Under such conditions, managers’ rational 


5 We use 5, to denote sales in dollars instead of volume, and we interpret у as the fraction of sales representing 
variable costs. Our interpretation is consistent with our empirical forecast model that uses data on sales revenue. 
Since volume data is unavailable, our study is limited to use of revenue data, which confounds price, volume, 
and mix changes. 

7 We are grateful to the referees for suggesting alternative specifications for the sales process that allow the AR(1) 
coefficient to depend on whether sales decrease in period t—1 as in the following two equations: 


5, = ag + 019,1 + а25, D, , + Es 
and AS, = a + a,AS,., + а,А5, 1р, + Es 
where D, , = 1 if sales decline in period /-1, and 0 otherwise. The parameter a, captures the possibly asym- 
metric behavior in sales changes. To address the concern that such asymmetric sales behavior confounds our 
results, we estimated models based on the above two alternative specifications using a pooled sample in 1992— 


2002 and also on a yearly basis. Our results are robust to these alternative specifications, indicating that the 
possibly asymmetric sales changes are not a confounding factor in the forecast performance of our CVCS model. 
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resource adjustment decision entails less reduction of resources when sales decrease than 
addition of resources when sales increase (Anderson et al. 2003). Unlike the static tradi- 
tional cost model that relates costs only to the contemporaneous level of sales volume 
independent of costs and volume in the prior period, cost stickiness exemplifies dynamic 
cost behavior that depends on costs and sales in the prior period and the direction of change 
_-in sales from the prior period. 

We model fixed costs as a first-order autoregressive process specified as: 


F, = Bo + ВЕ + В0(5,_, — S)D, + Er (5) 


where D, = 1 if sales decline in period t, and 0 otherwise. The term В,0(5,_, — $)D, in 
Equation (5) represents the additional amount of costs incurred due to costs being sticky 
when sales decrease in period t. That is, a portion B, of the variable costs in the amount 
of v(S, , — 820, is no longer driven down by a decrease in sales and as a result resembles 
a “fixed” cost. Costs that may exhibit such “sticky” behavior include skilled labor payroll 
costs, advertising and sales promotion costs, and branch operating costs. We recognize 
cost stickiness in the context of our variable and fixed cost model by allowing the level of 
fixed costs, ЕК, to increase when sales decline. 
Substituting S, and F, in Equation (3) with Equations (4) and (5), we have: 


E, = (1— 08, - Е 
= (1 — оба, + аб + Es) — (Bo + ВЕ + vBXS,.; — 5,0, + Ер) 
= (1 — ода + «|$, — Es) — Bo — ВИ — 9)S.., В) 
— оВ>(- + (1 — аб — &)D, — Ер 
= (1 — ој — Bo + ВЕ + (1 — va, — BS — 081 — о), 1р, 
+ vB.D£s5 — Er 
= yo + WD, + Yak + YS + 5,10, + m (6) 


where: 


Yo = (1 — о)о Во 


Yı = 008; 

ү = Ві 

үз = (1 — 0Ха; — By); 
уд = v(a, — ПВ»; and 
т = VBD Es — Ер. 





* Resources that are not mechanically linked to overall activity levels are likely to exhibit sticky behavior. For 
example, the payroll costs of contract employees could be reasonably viewed as variable with total revenue. 
However, in a period of sales downturn perceived to be temporary, managers may choose not to reduce the 
workforce proportionately in order to avoid having to incur additional costs of recruiting and retaining them 
when sales levels pick up in a subsequent period. Payroll costs for such employees retained in excess of the 
levels required for the reduced sales activity resemble “fixed costs" more than “уапаЫе costs" as traditionally 
defined. 
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Unlike E,_,, S, ,, and D, ,, which are all observed at the beginning of period г, the 
sales decrease dummy variable D, needs to be estimated before earnings Е, can be forecasted 
using model (6).? Letting D, denote the estimated value of D, based on information available 
at the beginning of period t, É, the forecasted earnings for period t, is given by: 


E, = Yo + yD, НЕ + Yasi + 45,10, (7) 


The forecast model is expressed in terms of observed earnings and sales from prior years. 
Fixed and variable costs need not to be observed or estimated separately. 


Ш. SAMPLE DATA AND MODEL ESTIMATION 
Data 

Our sample period spans 1988 through 2002.!° АП four time-series forecast models we 
consider require information on earnings or components of earnings based on line items 
from the income statement and the cash flows statement. These annual financial statement 
data are obtained from the Compustat database. We define earnings as income before ex- 
traordinary items (#18) and compute return on owners’ equity (ROE) by dividing earnings 
by beginning book value of equity (#60). We follow Fairfield et al. (1996) to measure the 
operating income (OPINC) and non-operating income (VOPTAX) components of earnings. 
Specifically, OPINC is measured as operating income after depreciation (#178) net of in- 
terest expense (#15), special items (#17), and minority interest (#49). NOPTAX consists of 
non-operating income (#61) net of income taxes (#16). Consistent with the accounting 
accruals literature (Collins and Hribar 2000), we measure the cash flows component (CFO) 
of earnings based on SFAS No. 95 data reported in the cash flows statement (#308 - #124) 
and the accruals component (ACCRUALS) as the difference between earnings and cash 
flows (#18 — #308 + #124).!! Our CVCS model requires, in addition to lagged ROE, 
information about sales revenue (SALES), which we measure as net sales revenue (#12). 
OPINC, NOPTAX, CFO, ACCRUALS, and SALES are all scaled by the beginning-of-year 
owners’ equity. We compute analysts’ consensus forecasts of one-year-ahead ROE compa- 
rable to forecasts generated from the time-series model as the mean analysts’ forecasts of 
earnings per share (EPS) reported on the I/B/E/S database nine months before fiscal year- 
end, divided by beginning-of-year book value of equity per share.'* 

To mitigate the potential adverse effect of data errors and outliers on the estimation of 
the time-series models and on the evaluation of forecast accuracy of these models and 
analysts' forecasts, we restrict our sample to firm-year observations with (1) positive values 
for owners' equity; (2) absolute values of ROE and lagged ROE less than 1; and (3) absolute 





-> The actual value of the sales decrease dummy variable in year г is not available when constructing earnings 
forecast for year t so we cannot use it in any earnings forecast model. 

10 We begin our study period Кот 1988 because SFAS No. 95 data required for computing cash flows from 
continuing operations (CFO) were not available prior to 1988. The data on CFO is used to decompose earnings 
into cash flows and accruals components in the CASHFLOW model (Collins and Hribar 2000). 

!! We require the earnings data reported in the statement of cash flows (#123) to be the same as that reported in 
the income statement (#18) to ensure comparability of cash fiows/accruals decomposition of earnings with the 
operating income/non-orerating income decomposition. 

12 This particular measure af analysts’ forecasts of one-year-ahead ROE is comparable to forecasts generated from 
the time-series models using lagged annual data in the sense that the timing of the consensus forecasts roughly 
coincides with the availability of previous fiscal year's financial statement information. Our method of deriving 
analysts’ forecasts of one-year-ahead ROE from their EPS forecasts follows that of Frankel and Lee (1998). 
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values of net profit margin (i.e., the ratio of earnings to sales revenue) less than 1.? Since 
we estimate the time-series forecast models using industry-level (based on four-digit SIC 
code) random coefficient regressions (discussed in detail in the next subsection), we delete 
firms from each forecast year and the associated estimation period (1.е., the four years 
immediately prior to the forecast year) that are in industries with less than 20 firm-year 
observations available in the estimation period. In order to calculate the forecast errors in 
ROE from all four time-series models and compare their forecast accuracy performance 
simultaneously, we require all sample observations to have no missing values for ROE in 
each forecast year and for the lagged variables: ROE, OPINC, NOPTAX, CFO, ACCRUALS, 
and SALES. 

These sample selection procedures result in a full sample of 39,367 firm-year obser- 
vations from 8,771 firms over the 11-year forecast period 1992-2002. To evaluate forecast 
accuracy of the four time-series models relative to that of analysts, we obtain a subsample 
of 15,500 firm-year observations from 4,334 firms with available analysts’ ROE forecasts. 
Descriptive statistics on ROE and components of ROE for the full sample are reported in 
Table 1. The median (mean) ROE is 0.092 (0.052) and the median (mean) SALES is 2.025 
(3.021). The median (mean) OPINC is 0.109 (0.079), the median (mean) NOPTAX is 
—0.019 (—0.027), the median (mean) CFO is 0.175 (0.174) and the median (mean) AC- 
CRUALS is —0.106 (—0.122). 


TABLE 1 
Descriptive Statistics on Return on Equity (КОЕ) and Components of ROE 
Sample Period: 1992-2002 (n = 39,367) 





запашне Median Mean Sul. Dev. _О1_ 93. 
ROE 0.092 0.052 0.250 -0.022 04173 
SALES 2.025 3.021 5.977 1.067 3.677 
OPINC 0.109 0.079 0.325 —0.045 0.239 
NOPTAX -0.019 -0.027 0.152 -0.077 0.017 
CFO 0.175 0.174 0.517 0.031 0.318 


ACCRUALS —0.106 —0.122 0.505 —0.237 0.000 


ROE = return on equity, measured as net income before extraordinary items (Compustat #18) divided by 
beginning-of-year book value of owners’ equity (#60); 
SALES = net sales revenue (#12) divided by beginning-of-year book value of owners’ equity; 
OPINC = operating income (#178 - #15 + #17 - #49) divided by beginning-of-year book уаз of 


owners’ equity, 
NOPTAX = non-operating income (en net of income taxes (#16) divided by beginning-of-year bcok value 
of owners’ 
CFO - cash flows from continuing operations (#308 — #124) divided by beginning-of-year book value 
of owners' equity; an 
ACCRUALS = total accruals (#18 — вов + #124) divided by beginning-of-year book value of owners’ equity. 


13 Our sample screening procedure consists of two steps. The first step screens each estimation samp.e based on 
all information available up to the end of estimation period (.е., year 1—4 to year t—1 in our case». This step 
(estimation and forecasting) is implementable ex ante since no information beyond year /-1 is used to cull the 
sample. The second step screens the forecast sample by deleting extreme values of ROE in the forecast year 
(i.e., year г) to evaluate the predictive performance of competing forecast models. This evaluation is ex post 
since it requires actual values of ROE in year г. This step helps alleviate concerns about outliers possibly 
inflating the average absolute forecast errors and thus obscuring the differential predictive ability of competing 
forecast models. Our results about the superior predictive ability of our CVCS model hold even when we do 
not employ any screening of the forecast sample. 
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Forecast Model Estimation 

We benchmark the performance of our CVCS model in forecasting ROE against a 
simple model based only on lagged ROE data (the ROE model), the OPINC model based 
on lagged operating and non-operating income disaggregation, and the CASHFLOW model 
based on lagged cash flows and accruals components. Specifically, the four time-series ROE 
forecast models we consider are: 


ROE model: ROE, = Үд + YaROE,, + =, (8) 
OPINC model: ROE, = үө + лы ОРІХС, ү + y, МОРТАХ, | + 5, (9) 
CASHFLOW model: КОЕ, = ул + y4CFO, , + y,ACCRUALS, , + е, (10) 


CVCS model: ROE, = у + Ya DECRDUM, + урКОЕ, | 
+ y,3SALES,_, + "Y¥4,SALES,_;DECRDUM, 
+E (11) 


where DECRDUM, is a sales decrease indicator variable that takes on the value 1 when 
the unscaled sales revenue in period t is less than in period 1—1, and 0 otherwise. 

We employ industry-level (based on four-digit SIC code) random coefficient regressions 
(ВСК) to estimate our CVCS model.'* We allow for stochastic heterogeneity in model 
parameters for firms in each industry. The coefficients for the firms are random variates 
drawn from a separate distribution (mean and variance) for each industry. To illustrate, the 
set of coefficients in Equation (11) is assumed to have the following underlying stochastic 
Structure: 


Ya = Ya + O5 M, i = 0, 1,...,4 


where у2? is the mean (constant) for a given industry in which a particular firm belongs 
and ЗЕМ. МО, сур) is the firm's deviation from the industry mean. Each М follows 
a normal distribution with mean 0 and industry-specific variance оф, Since we use a 
pooled sample, there remains residual unexplained variation e, in ROE, for each firm in any 
period t. Similar stochastic structure also applies to coefficients in the other three times- 
series models (i.e., Equations (8)-(10)), allowing us to estimate those coefficients using 
RCR. We do this to ensure that the four alternative models are evaluated on a comparable 
basis. 

Following Fairfield et al. (1996), we estimate the parameters of all four forecast models 
for each year on a rolling basis using all available observations from the preceding four- 
year period. For example, we use data from 1998—2001 to estimate the forecast model 


14 Firm-specific estimation of our CVCS model is not feasible because the model has seven parameters to be 
estimated and this requires a long time-series of data if firm-level (rather than pooled) estimation is conducted. 
However, the assumption that model parameters (c.g., cost structure) are constant over time is also difficult to 
sustain when a long time-series of data is employed to ensure a reasonable number of degrees of freedom for 
the estimation. Therefore, the compromise we adopt is industry-level estimation with random coefficients for 
firms in each industry. у ! 

15 This rolling estimation approach is also used in Fairfield et al. (1996). Our method differs from theirs in that 
we use industry-level random coefficient regressions based on four-year estimation period, whereas they use 
pooled regressions based on seven-year estimation period. 
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parameters applicable to constructing the ROE forecast for 2002; we obtain a new set of 
parameter estimates using data from 1997—2000 in order to derive the ROE forecast for 
2001, and so on. Since we consider an 11-year forecast period from 1992—2002, we have 
11 corresponding overlapping four-year estimation periods from 1988—2001. This industry- 
level RCR approach produces, for each model, coefficient estimates of the industry mean 
and deviation of each individual firm from its industry mean. The estimate of a firm-specific 
random-effect coefficient is given by the sum of the industry mean and firm-specific de- 
viation.'® We use this firm-specific estimate to compute the ROE forecast during the forecast 
period. To illustrate in algebraic terms, the set of RCR coefficient estimates in Equation 
(11) is expressed as: 


За = YAP + S58 і = 0, 1,...,4 


where 7X is the estimated industry mean and 8м is an individual firm's estimated 
deviation from the industry mean. 

Table 2 reports the means and standard deviations of estimated coefficients of the time- 
series forecast models calculated over the 11 four-year rolling industry-level (based on four- 
digit SIC code) random coefficient regressions.” For ће ROE model, the average slope 
coefficient on ROE, , is 0.513. The average slope coefficient estimates on OPINC, , and 
NOPTAX, , in the OPINC model are 0.485 and 0.352, respectively. The different slope 
coefficient estimates on OPINC, , and NOPTAX, , suggest that decomposing earnings into 
operating and non-operating income may provide incremental predictive power for fore- 
casting one-year-ahead ROE. Similarly, the slope coefficients on the cash flows (CFO,_,) 
and accruals (ACCRUALS, |) components in the CASHFLOW model are different (0.541 
and 0.473, respectively).!? 


IV. FORECAST ACCURACY COMPARISONS 
Main Results 
To forecast ROE with our CVCS model, we require in addition to the parameter esti- 
mates an indicator of whether sales increase or decrease in the forecast year. This estimator 
must be based on information available prior to the forecast period. We estimate the sales 
decrease indicator DECRDUM, using a logit regression of the following form: 


e 


Prob(DECRDUM, = 1) = 1:2 





(12) 


16 In addition to being firm-specific, the gammas in Equations (8)-(11) and (14)-417) are indexed by estimation 
periods because of our rolling estimation method. We omit both firm and time indices in these equations to 
avoid cumbersome notation. 

17 The summary statistics reported in Table 2 are based on estimated coefficients for individual firms retrieved 
from the estimation of the random coefficient regression (RCR) model averaged over 8,771 firms in all 432 
industries and over the 11 estimation periods. The mean and standard deviation reported in Table 2 are the mean 
and standard deviation of those coefficient estimates. 

18 Sloan (1996) estimated a similar regression model of earnings on lagged cash flows and accruals. Using pooled 
estimation, he reported 0.855 and 0.765 as the coefficients on the lagged cash flows and accruals, respectively, 
and using industry-level estimation, he reported 0.781 and 0.721 as the means of these coefficients. Our results 
are consistent with Sloan’s to the extent, that the coefficient estimate on cash flows is greater than that on 
accruals. Sloan’s estimated magnitude of these coefficients may differ from ours because he (1) used a different 
sample period (from 1962 to 1991); (2) scaled earnings, cash flows, and accruals by average total assets; and 
(3) used pooled (or industry-level pooled) regressions. 


The Accounting Review, March 2006 


Predicting Earnings Using a Model Based on Cost Variability and Cost Stickiness 


TABLE 2 


295 


Coefficient Estimates* of Time-Series ROE Forecast Models from 11 Four-Year Rolling 
Industry-Level (four-digit SIC) Random Coefficient Regressions 
Sample Estimation Period: 1988-2001 


Independent Variable 
Intercept 

DECRDUM, 

LOSS 

ROE,_, 

LOSS З КОЕ, 
ОРІМС, | 

МОРТАХ, , 

CFO, 

ACCRUALS,- 
SALES, | 

SALES, , * DECRDUM, 


Average Adj. Е> 


ROE OPINC 
Model Model 
Mean Mean 
Coeff. Est. Coeff. Est. 
(Std. Dev.) (Std. Dev.) 
0.015 0.013 
(0.028) (0.028) 
0.513 
(0.094) 
0.485 
(0.065) 
0.352 
(0.104) 
0.2649 0.2911 


CASHFLOW Asy. ROE 


Model Model 


Coeff. Est. Coeff. Est. 


(Std. Dev.) (Std. Dev.) 
0.007 0.030 
(0.025) (0.021) 
—0.054 
(0.029) 
0.464 
(0.064) 
—0.070 
(0.106) 
0.541 
(0.058) 
0.473 
(0.071) 
0.3030 0.2985 


Coeff. Est. 


0.002 
(0.004) 

—0.008 
(0.008) 
0.3676 


“The numbers reported in each cell are the mean and standard deviation (in parentheses) of firm- and period- 
specific coefficients estimated using the random coefficient regression (RCR) over 8,771 firms in 432 industries 
and over the 11-year estimation period. 

^ Since a goodness-of-fit measure like R? is not well-defined in the context of RCR estimation, the adjusted 
R?s are obtained from estimating the forecasting models using OLS for each four-digit-SIC industry 


and each estimation period. The average is calculated across all industries and estimation periods. 


ROE model: 

OPINC model: 
CASHFLOW model: 
Asy. ROE model: 


CVCS model: 


КОЕ, = Yao + Yas ROE, , + £, 


ROE, = Yso + зы OPINC, -1  Y,;NOPTAX, , + вы 


ROE, = Yeo + СЕО, + Y4ACCRUALS, , + е. 


ROE, = үд + ү.1:055 + у ,ROE, , + y,LOSS*ROE,., + е, 


ROE, = за + за РЕСЕРОМ, + y,ROE,, 
+ YASALES, , + SALES, ,DECRDUM, + €y 


(continued on next page) 
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TABLE 2 (Continued) 


ROE = return on equity, measured as net income before extraordinary items divided by beginning-of-year 
book value of owners’ equity; 
OPINC = operating income divided by the beginning-of-year owners’ equity 
NOPTAX = non-operating income net of income taxes divided by Берон uf. -year book value of owners' 


equity; 
CFO = vu flows from continuing operations divided by beginning-of-year book value of owaers' 


equity; 
ACCRUALS = total accruals divided by beginning-of-year book value of owners’ equity; 
DECRDUM = a dummy variable that takes the value 1 if sales revenue decreases from prior year, an 0 
otherwise; 
LOSS = a dummy variable that takes the value 1 if earnings in the prior year are negative, and 0 
otherwise; and 
SALES - net sales revenue divided by beginning-of-year book value of owners' equity. 


where y = В, + B,AREV, , + є, AREV, is the percentage change in sales revenue from 
year 1—2 to year t—1. We estimate the parameters B, and В, of this logit model or a rolling 
basis for each forecast year using pooled data from the preceding four-year period.!? Table 
3 shows the estimation results of logit regressions. The estimated mean coefficient on 
AREV, , across the 11 estimation periods is B, — —1.865, and the corresponding mean 
odds ratio estimate is 0.157. This result indicates that a one percentage point decrease in 
sales in year t—1 is associated with a six-fold (i.e., 1/0.157 = 6) increase in the odds ratio 


TABLE 3 
Estimation Results of Logit Regression of Sales Decrease Dummy on the Lagged Percentage 
Change in Sales Revenue From 11 Four-Year Rolling Regressions 
Sample Estimation Period: 1988—2001 


ехр(В, + B,AREV,_, + £) 
1 + ехр(В, + B,AREV, , + 22 


Panel A: Coefficient Estimates and Odds Ratio Estimates 


Logit Model: DECRDUM, = 


Mean Mean Mean Lower Mean Upper 
Independent Coeff. Est. Odds Ratio Confid. Limit Confid. Limit 
Variable (Std. Dev.) (Std. Dev.) (Std. Dev.) (Std. Dev.) 
Intercept | —0.743 
(0.144) 
AREV, —1.865 0.157 0.133 0.185 
(0.155) (0.024) (0.020) (0.029) 
Panel B: Measures of Association 
Mean Percent Concordant Mean Percent Discordant Mean Percent Tied 
(Std. Dev.) (Std. Dev.) (Std. Dev.) 
64.190 35.080 0.730 
(0.717) (0.704) (0.053) 


DECRDUM, = a dummy variable that takes the value 1 if sales revenue decreases in year г, and 0 otherwise; 
and 


ARET, , = percentage change in sales revenue from year t—2 to year t—1. 


19 To mitigate concerns for undue influence of potential outliers, we deleted observations with absolcte value of 
AREV,_, greater than 1 in each round of estimation. 
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of sales decline in year t. The high concordant measure of 64 percent indicates that this 
simple and parsimonious logit model (12) performs reasonably well in predicting the di- 
rection of change in sales revenue. 

We determine the estimated value of the sales decrease dummy variable 
(DECRDUM, = Й.) for the forecast period as: 


1, if By + B,AREV,_, > 0 


DECRDUM, = | бо + BAREV < 0 


(13) 


Our classification scheme based on the zero cutoff in Equation (13) is interpreted as a 
simple heuristic that predicts a sales decline (DECRDUM = 1) if the estimated probability 
is greater than 50 percent. 

The forecast of КОЕ from our CVCS model is then given by: 


ROE, = 44 + За РЕСЕБОМ, + $4ROE, | + 44SALES, | 
| + 44SALES, ,DECRDUM, (14) 


where, Yap, Чар ја are the five estimated coefficients of the CVCS model in Equation 
(11) 


The ROE forecasts from the ROE model, the OPINC model, and the CASHFLOW . 
model, respectively, are as follows: 


ROE, = Yao + 44, КОЕ, , (15) 
ROE, = 4,9 + 3,,0PINC, , + %„МОРТАХ,_, (16) 
ROE, = З. + За CFO, + 3$; ACCRUALS, | (17) 


where ‘4s in the above expressions are the estimated coefficients of their corresponding 
models in Equations (8), (9), and (10). 

Based on the estimated parameters of the four forecast models and the estimated di- 
rection of change in sales revenue, we obtain four forecasts of ROE in forecast period 1 
(ROE,) as given previously in Equations (14)-(17). The forecast error (FE) is defined as 
the difference between the actual realization of ROE in year t and the forecasted КОЁ, 


FE, = ROE, — ROE, (18) 

The absolute forecast error (AFE) is the absolute value of the forecast error FE: 
AFE, = |ЕЕ | (19) 
The statistics on the distribution of AFE for the time-series models are shown in Panel 
A of Table 4. The CVCS model generates AFE with the smallest median (0.0734), mean 
(0.1412), first quartile (0.0291), and third quartile (0.1779) of the four time-series models 
considered. In contrast, the basic ROE model produces AFE with the largest median 
(0.0805), mean (0.1427), first quartile (0.0340), and third quartile (0.1790). The OPINC 
and CASHFLOW models represent intermediate cases in terms of their forecast accuracy 
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TABLE 4 
ROE Forecast Accuracy Comparisons among Time-Series Forecast Models Estimated From 
11 Four-Year Rolling Industry-Level (four-digit SIC) Random Coefficient Regressions 
Sample Forecast Period: 1992-2002 (n = 39,367) 


Panel A: Distributions of Absolute Forecast Errors (AFE) 


Forecast Model Median Mean Std. Dev. EUM 93. 
ROE 0.0805 0.1427 0.1735 0.0340 0.1790 
OPINC 0.0797 0.1422 0.1736 0.0334 0.1784 
CASHFLOW 0.0797 0.1420 0.1740 0.0337 0.1780 
Asy. ROE 0.0791 0.1423 0.1738 0.0328 0.1784 
CVCS 0.0734 0.1412 0.1788 0.0291 0.1779 
Panel B: Distributions of Pair-Wise Differences in Absolute Forecast Errors (AFE)* 
Comparison Median Mean Std. Dev. of 
Base Model _ Мова | Difference Difference Difference 
ROE OPINC 0.0002*** 0.0005*** 0.0215 
ROE CASHFLOW 0.0005 *** 0.0006* ** 0.0370 
ROE CVCS 0.0052*** 0.0010*** 0.0446 
OPINC CASHFLOW 0.0001 0.0001 0.0400 
OPINC CVCS 0.0035*** 0.0009+++ 0.0431 
CASHFLOW CVCS 0.0031*** 0.0008*** 0.0493 


хээв, же, ж indicate significance at the 1 percent, 5 percent, and 10 percent levels, respectively, using Wilcoxon 
signed rank test. 

+++ ++, + indicate significance at the 1 percent, 5 percent, and 10 percent levels, respectively, using the t-test. 
* Pair-wise difference in AFE = AFE from the base model — AFE from the comparison model. 


measured using distributions of AFE. The forecast accuracy improvement of the CVCS 
model relative to the benchmark ROE model in terms of reduction in median AFE is 0.0071 
(0.0805 — 0.0734), which is about nine times as large as the improvement of ће OPINC 
or CASHFLOW models relative to the ROE model (reduction in median AFE = 0.0008). 

To provide a statistical test of the forecast improvement, following Fairfield et al. 
(1996), we employ the nonparametric Wilcoxon signed rank test to evaluate the paired 
differences in AFE between two forecast models without imposing restrictive assumptions 
about the exact form of the underlying distribution (Lehmann 1975; Sheskin 1997).” Panel 
B of Table 4 presents pair-wise comparisons of AFE among the four models based on the 
distribution of the difference in AFE between any two competing models. Included in each 
row are the median, mean, and standard deviation of the difference in AFE between a 
comparison model and a base model. Positive (negative) values of the difference in AFE 
indicate that the comparison model generates a lower (higher) level of AFE and thus is 
more (less) accurate in forecasting one-year-ahead ROE, as compared to the base model. 
All the results based on pair-wise comparisons are consistent with those based on compar- 
isons at the median or mean AFE in Panel A of Table 4. The median reduction in AFE of 





20 Since the hold-out samples used in comparing any two competing forecast models consist of the same set of 
firm-year observations, the appropriate statistical test of relative forecast accuracy is based on a matched sample 
comparison of AFEs. 
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the OPINC model relative to the ROE model is 0.0002 (p « 0.001)?! The CASHFLOW 
model also provides more accurate forecasts than the ROE model, with a median improve- 
ment of 0.0005 (p < 0.001). In contrast to the OPINC and CASHFLOW models that exhibit 
modest forecast accuracy improvement over the ROE model, the CVCS model improves 
forecast accuracy substantially. The median reductions in the AFE of CVCS relative to the 
ROE, OPINC, and CASHFLOW models are 0.0052 (p < 0.001), 0:0035 (p < 0.001), and 
0.0031 (p < 0.001), respectively. Examining the mean differences in AFE produces qual- 
itatively similar results.” 

The Table 4 evidence confirms the previous result that disaggregating earnings into its 
operating and non-operating components leads to more accurate forecasting of one-year- 
ahead ROE. It also demonstrates that decomposing earnings into cash flows and accruals 
has a similar impact on ROE forecast accuracy. More importantly, decomposing earnings 
based on cost variability and cost stickiness provides substantial incremental predictive 
ability in forecasting future ROE, over and above the forecast improvement of earnings 
decomposition based on financial statement line items that do not recognize the relation 
between sales and costs. 

The Table 5 results are based on a subsample of firms containing analysts’ consensus 
forecasts. Using the Frankel and Lee (1998) procedure to convert earnings per share (EPS) 
forecasts to ROE forecasts, we calculate the analysts’ AFE in ROE based on their EPS 
forecasts reported on the I/B/E/S database. The statistics on the AFE distributions of 
analysts’ forecasts and time-series model forecasts are presented in Panel A of Table 5. 
Consistent with evidence of analyst earnings forecast superiority vis-à-vis time-series mod- 
els (Brown 1993), the analysts’ consensus forecasts are the most accurate, with median 
(mean) AFE. of 0.0457 (0.0976), which is 59 percent (80 percent) of the median (mean) 
AFE from the benchmark ROE model (median AFE = 0.0775, mean AFE = 0.1215). 
Consistent with the results in Table 4, the CVCS model is the most accurate of the three 
time-series models based on lagged earnings components, with median (mean) AFE of 
0.0675 (0.1168). The OPINC and the CASHFLOW models have very similar forecasting 
performance, with median (mean) AFE of 0.759 (0.1205) and 0.760 (0.1206), respectively. 

Panel B of Table 5 reports the results of comparing forecast accuracy based on pair- 
wise differences in AFE. The median reductions in matched AFE from the OPINC and the 
CASHFLOW models relative to the ROE model are 0.0005 and 0.0006, respectively, which 
is less than 3 percent of the median reduction in matched AFE from I/B/E/S consensus 
forecasts (0.0206). In contrast, the median reduction in matched AFE from our CVCS model 
is 0.0113, over half (55 percent) of that attributable to analysts’ consensus forecasts. Similar 
to the Table 4 findings, these results indicate that the OPINC and CASHFLOW models are 
almost indistinguishable, with median difference of —0.0002 and mean difference of 0.0001. 
Both differences are statistically significant at 1 percent. In addition, I/B/E/S forecasts 
beat both the OPINC and CASHFLOW models and, to a lesser extent, the CVCS model, 
in pair-wise comparisons of forecast accuracy. 

To assess the significance of forecast improvement of the three models of earnings 
decomposition, we calculate the reduction in median and mean AFE from the OPINC, 


?! Fairfield et al. (1996) reported a median reduction of 0.0008 in absolute forecast error using the OPINC model 
relative to the ROE model. 

2 We conducted a t-test for the equality of the means of two related samples. The t-statistics are all significant at 
the 0.001 level, indicating that both the OPINC and the CASHFLOW models perform better than the ROE 
model but worse than the CVCS model. The two-sample t-test requires the assumption of normal distribution 
in both samples. Since the normality assumption is found not to be descriptive of the distributions of AFE, we 
view the t-test results as suggestive, not conclusive. 
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TABLE 5 
ROE Forecast Accuracy Comparison between Time-Series Forecast Models and I/B/E/S 
Consensus Forecasts* 
Sample Forecast Period: 1992—2002 (п = 15,500) 
Panel A: Distributions of Absolute Forecast Errors (AFE) 
Forecast Model Median Mean Std. Dev. 01. _93 
ROE 0.0775 0.1215 0.1364 0.0364 0.1511 
OPINC 0.0759 0.1205 0.1370 0.0359 0.1490 
CASHFLOW 0.0760 0.1206 0.1374 0.0354 0.1499 
Asy. ROE 0.0747 0.1202 0.1369 0.0345 0.1511 
CVCS 0.0675 0.1168 0.1412 0.0289 0.1454 
I/B/E/S 0.0457 0.0976 0.1388 0.0169 0.1156 
Panel B: Distributions of Pair-Wise Differences in Absolute Forecast Errors (АЕЕУ 
Comparison Median Mean Std. Dev. of 

Base Model Model Difference Difference Difference 
ROE OPINC 0.0005 ***. 0.0008 ** 0.0183 
ROE CASHFLOW 0.0006*** 0.0009* ** 0.0309 
ROE CVCS 0.0113*** 0.0047+++ 0.0354 
КОЕ I/B/E/S 0.0206*** 0.0239+++ 0.1148 
OPINC CASHFLOW —0.0002 0.0001 0.0335 
OPINC CVCS 0.0075*** 0.0037* ** 0.0387 
OPINC I/B/E/S 0.0190*** 0.0229* ** 0.1153 
CASHFLOW CVCS 0.0075*** 0.0038*** 0.0437 
CASHFLOW I/B/E/S 0.0230*** 0.0230+++ 0.1158 
CVCS I/B/E/S 0.0105*** 0.0192* ** 0.1049 


Panel C: Forecast Improvement over the ROE Моде» 


Accuracy Improvement 
over the ROE Model 


% of Accuracy Improvement 
of I/B/E/S over the ROE Model 


Difference in Difference in Difference in Difference in 
Forecast Model Median AFE Mean AFE Median AFE Mean AFE 
OPINC 0.0016 0.0010 5.0 42 
CASHFLOW 0.0015 0.0009 ` 4.7 3.8 
CVCS 0.0100 0.0047 31.4 19.7 
I/B/E/S 0.0318 0.0239 100.0 100.0 


эээ, же, * indicate significance at the 1 percent, 5 percent, and 10 percent levels, respectively, using Wilcoxon 
signed rank test. 

+++, ++, * indicate significance at the 1 percent, 5 percent, and 10 percent levels, respectively, using the t-test. 
* ROE forecasts calculated based on analysts’ consensus forecasts of earnings per share reported on I/B/E/S 
database nine months before the fiscal year end. 

5 Forecast improvement is measured as the difference in median or mean absalute forecast errors between 

two forecast methods. 

° Pair-wise difference in AFE = AFE from the base model — AFE from the comparison model. 


CASHFLOW, and CVCS models, respectively, as a percentage of the reduction in AFE of 
analysts’ forecasts, all benchmarked against the ROE model. The differential forecast ac- 
curacy between analysts’ forecasts and the ROE model reflects the total “information gap” 
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existing between the basic time-series model that uses only aggregate historical earnings 
information and financial analysts who utilize a significantly richer set of information, 
possibly including information contained in earnings components, in making earnings fore- 
casts. The results are shown in Panel C of Table 5. The CVCS model improves forecast 
accuracy in median АРЕ by 0.0100 relative to the ROE model, representing 31.4 percent 
of the total improvement of analyst consensus forecasts over the ROE model (reduction in 
median AFE = 0.318). In contrast, the corresponding figures for the OPINC and the CASH- 
FLOW models are only 5.0 percent and 4.7 percent, respectively. 

Taken together, these results in Table 5 indicate that earnings decomposition that rec- 
ognizes cost variability and cost stickiness provides substantial incremental predictive con- 
tent in forecasting one-year-ahead ROE relative to earnings decomposition that ignores such 
properties of earnings components. The evidence on the forecasting performance of our 
CVCS model relative to the more accurate analysts’ consensus forecasts suggests a prom- 
ising line of inquiry to examine if cost behavior information is a source of financial analysts’ 
advantage over time-series models. 


Loss Firms 


Prior work in the accounting literature demonstrates that losses have lower earnings 
response coefficient (ERC) than profits, suggesting that the former is not as persistent as 
the latter due to the liquidation option (Hayn 1995). Our CVCS model contains a term that 
captures the differential impact of sales increase and decrease on future earnings. To the 
extent that the direction of sales changes is correlated with the sign of earnings, the observed 
superior forecasting performance of our CVCS model may be confounded.” To test this 
possibility, we augment the simple baseline ROE model (Equation (8)) to allow for differ- 
ential persistence of profits and losses. This asymmetric ROE model is specified as follows: 


ROE, = Yeo + y,,LOSS + y,4ROE, , + y,LOSS*ROE, , + Е, (87 
_ Ji, if ROE,., «0 
where LOSS = (А if ROE, > 0 


We expect the revised ROE model in Equation (8’) to produce more accurate earnings 
forecasts than the simple ROE model in Equation (8) that does not distinguish between 
positive and negative earnings. More importantly, we expect the CVCS model to perform 
better than the asymmetric ROE model if our CVCS model captures an aspect of the 
earnings process beyond the liquidation option. 

In estimating the asymmetric ROE model in Equation (8") and forecasting one-year- 
ahead ROE, we follow the procedure described in the section titled “Forecast Model Es- 
timation” to provide earnings forecasts on a comparable basis. Descriptive statistics on 
absolute forecast errors from the asymmetric ROE model are reported in Panel A of Tables 
4 and 5. As expected, the asymmetric ROE model generates more accurate earnings fore- 
casts than the simple ROE model. In Panel A, Table 4, for the full sample, the median 
(mean) AFE of the asymmetric ROE model is 0.0791 (0.1423), while the median (mean) 
AFE of the simple ROE model is 0.0805 (0.1427). Results reported in Panel A of Table 5 
for the subsample of observations with available I/B/E/S forecasts are quantitatively sim- 
ilar to those in Table 4: the median (mean) AFE of the asymmetric ROE model is 0.0747 


23 We are grateful to an anonymous referee for suggesting we examine the possibly confounding effect of differ- 
ential persistence of profits and losses. 
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(0.1202), while the median (mean) AFE of the simple ROE model is 0.0775 (0.1215). Such 
improvement in forecasting accuracy, however, does not fully explain the superior perform- 
ance of our CVCS model: the median (mean) AFE of the CVCS model is 0.0734 (0.1412) 
for the full sample and 0.0675 (0.1168) for the I/B/E/S subsample. These results suggest 
that by incorporating asymmetric cost behavior, our CVCS model captures an aspect of the 
earnings process that is not a mere manifestation of the liquidation option. 


V. ASSOCIATION WITH ABNORMAL STOCK RETURNS 

In the previous two sections, we focus on forecast accuracy when comparing our CVCS 
model with the ROE, OPINC, and CASHFLOW models. Another performance criterion 
widely used in capital market research when evaluating earnings forecast models is based 
on the association of earnings forecast errors (i.e., unexpected earnings conditioned on a 
forecast model) with contemporaneous stock returns (Fried and Givoly 1982; Brown et al. 
1987b; O'Brien 1988). This performance criterion reflects the informativeness (or infor- 
mation content) of an earnings forecast model in capturing investors' earnings expectations. 
Under our maintained assumption that the market's expectation is informationally efficient, 
the closer a selected model comes to mimicking the market's expectations, the less noise 
there is in forecasts from the model in representing unobservable market expectations, and 
thus the higher the association we expect to observe between the forecast errors from 
the model and the contemporaneous abnormal stock returns (Foster 1977). Therefore, the 
strength of the association between forecast errors and abnormal returns provides a yardstick 
to measure the relative information content of alternative earnings forecast models. 

Prior research finds that forecast models with superior accuracy are not necessarily 
superior on the association dimension (Hughes and Ricks 1987; O'Brien 1988) so we 
evaluate the performance of the CVCS model based on the association between its forecast 
errors and contemporaneous stock returns, and compare it with the associations using the 
ROE, OPINC, CASHFLOW, and analyst models. We follow Dechow (1994) in evaluating 
the relative information content by using the likelihood ratio test proposed by Vuong (1989) 
for the following five simple regression models: 


RET, = a + а FER + Ем (20) 
RET, = ay, + Ap FELPE + e, (21) 
RET, = a, + a, ,FEPASPFLOW + Б, (22) 
RET, = ар + ap EVS + Ба (23) 
КЕТ, = ад + a БЕЗ + e, (24) 


where RET, is the size- and book-to-market-adjusted contemporaneous annual stock returns 
(Fama and French 1992), and FERO#, FEOPINC, pEcASHFLOW FECVCS and РЕВЕ are the 
forecast errors based on the ROE, the OPINC, the CASHFLOW, the CVCS models and 
I/B/E/S analyst consensus forecasts, respectively. The advantage of using Vuong's (1989) 
test is that it provides directional inferences as to which of two competing models is closer 
to the true underlying model without assuming that one model is the true one. The Vuong 
test is particularly suitable when the models being compared have significant explanatory 
power, as is true in our case (Vuong 1989; Dechow 1994). 
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We obtain information on the portfolio returns formed conditioned on size (market 
value of equity) and the book-to-market ratio to compute abnormal returns. The sample we 
use for the market association analysis is the intersection of the sample firms used above 
to compare earnings forecasts, and the CRSP database from which we obtain data on stock 
returns. We omit firms without December 31 fiscal year-ends and firm-year observations 
whose absolute abnormal stock returns exceed 100 percent. We also exclude observations 
with the most extreme 1 percent of forecast errors of each model. Our final sample is 8,745 
firm-years from 2,747 firms from 1992 to 2002.4 

Table 6 presents the correlation coefficients between forecast errors and abnormal stock 
returns. The Pearson (Spearman) correlations are above (below) the diagonal. The CVCS 
model produces forecast errors that are the most highly correlated with abnormal returns 
among the four time-series models we consider (Pearson p = 0.350, Spearman p = 0.375), 
while I/B/E/S analyst forecast errors are more highly correlated with abnormal returns 
(Pearson p = 0.387, Spearman p = 0.459) than any of the time-series model. The fact that 
the four time-series models generate highly correlated forecast errors is not surprising since 
they are conditioned on similar information sets, but the superior performance of the CVCS 
model relative to the other three time-series models obtains despite the near-unity corre- 
lations among them. 

Table 7 contains results of the test of relative information content. Panel A reports 
regression results on the association between abnormal stock returns and forecast errors 


TABLE 6 
Correlation Coefficients* (p-values in parentheses) between Abnormal Stock Returns (RET)? 
and Forecast Errors (FE): in Returns on Equity (ROE) Based on Four Time-Series Forecast 
Models and I/B/E/S Consensus Forecasts 
Sample Period: 1992-2002 (п = 8,745) 














RET ЕЕВОВ FROPINC FECASBFLOW ЕЕСҮСЗ FETS 
RET 0.342 0.338 0.331 0.350 0.387 
(0.000) (0.000) (0.000) (0.000) (0.000) 
FEROE 0.362 0.993 0.978 0.992 0.702 
(0.000) (0.000) (0.000) (0.000) (0.000) 
FEOPINC 0.360 0.990 0.975 0.985 0.701 
(0.000) (0.000) (0.000) (0.000) (0.000) 
ЕЕСАЗНЕТОМ 0.348 0.971 0.965 0.973 0.697 
(0.000) (0.000) (0.000) (0.000) (0.000) 
FESS 0.375 0.984 0.975 0.959 0.723 
(0.000) (0.000) (0.000) (0.000) (0.000) 
FEES 0.459 0.652 0.652 0.641 0.675 
(0.000) (0.000) (0.000) (0.000) (0.000) 


* Pearson correlation coefficients are above the diagonal, and Spearman correlation coefficients are below the 
diagonal. 

* Abnormal stock returns are measured as size and book-to-market adjusted returns, or annual raw returns less 
portfolio returns matched on size (market value of equity) and book-to-market (the ratio of book value of 
equity to market value of equity). 

° РЕКЕ, FROPINC, PECASHFLOW FECVCS and FE®® are forecast errors based on ће КОВ, OPINC, CASHFLOW, 
CVCS models, and I/B/E/S consensus analysts’ forecasts, respectively. 


24 Results are similar to those reported here for the relative performance of the four time-series models when we 
use a larger sample of 17,170 firm-years that includes observations for which 1/В/Е/5 data are not available. 
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TABLE 7 
Regression Results of Abnormal Returns* on Forecast Errors (ЕК)? and Results of Vaong’s 
Test of Relative Information Content of Four Time-Series Forecast Models and I/B/E/S 
Consensus Forecasts 
Sample Period: 1992—2002 (n = 8,745) 


Panel A: Simple Regression of Abnormal Returns on Forecast Errors (FE) 
Independent Coeff. Est. Coeff. Est. Coeff. Est. Coeff. Est. Coeff. Est. 


Variable (t-statistic) (t-statistic) (t-statistic) (t-statistic) (t-statistic) 
Intercept —0.002 —0.001 —0.001 0.013 0.025 

(—0.60) (—0.26) (—0.18) (1.22) (4.71) 
ЕЕЖОЕ 0.829 

(34.03) 
FEOPINC 0.818 

(33.53) 
FECASHFLOW 0.803 
(32.76) 
FEYS 0.845 
(34.95) 
FEBES 1.020 
(39.21) 

Adj. R? 0.1169 0.1139 0.1093 0.1225 0.1495 
Panel B: Results of Vuong's Test of Relative Information Content 
Competing Forecast Models Vuong's Z-statistic" p-value 
ROE vs. OPINC -322 0.002 
ROE vs. CASHFLOW —4.66 0.000 
ROE vs. CVCS 5.45 0.000 
ROE vs. I/B/E/S 4.76 0.000 
OPINC vs. CASHFLOW —2.62 0.009 
OPINC vs. CVCS 6.29 0.000 
OPINC vs. I/B/E/S 5.22 0.000 
CASHFLOW vs. CVCS 7.12 0.000 
CASHFLOW vs. I/B/E/S 5.85 0.000 
CVCS vs. I/B/E/S 4.04 0.000 


* Size and book-to-market adjusted returns equal annual raw returns less portfolio returns matched on size 
(market value of equity) and book-to-market (the ratio of book value of equity to market value of equity). 

b FE®OE, руорімс, FECASHFLOW FECVCS and РЕВЕ are forecast errors based on ће ROE, OPINC, CASHFLOW, 
CVCS models, and I/B/E/S consensus analysts' forecasts, respectively. 

* Vuong's Z-statistic is based on the likelihood ratio test developed by Vuong (1989) for non-nested model 
selection; a significant positive (negative) Z-statistic indicates that the first (second) forecast model is rejected 
in favor of the other forecast model. 


in return on equity based on various forecast models. The highest adjusted R? of 0.1495 
is achieved when abnormal returns are regressed on I/B/E/S analyst forecast errors, fol- 
lowed by forecast errors based on the CVCS model (adjusted R? — 0.1225). In Panel B, 
results based on pair-wise comparison indicate that the relative information content of the 
I/B/E/S analyst forecast is significantly greater than that of any of the mechanical forecast 
models. However, among the four time-series models we consider, Vuong's test rejects all 
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three benchmark models when each of them is compared against the CVCS model. Spe- 
cifically, Vuong’s Z-statistic is 5.45 (p < 0.001) for the comparison between the ROE model 
and the CVCS model, 6.29 (p < 0.001) for the comparison between the OPINC model and 
the CVCS model, and 7.12 (p < 0.001) for the comparison between the CASHFLOW 
model and the CVCS model, all in favor of the CVCS model. In sum, these findings are 
consistent with the CVCS model having greater relative information content in measuring 
market’s earnings expectations than other time-series forecast models that do not exploit 
the information in the relation between sales and cost items in the financial statements. 


VI. CONCLUDING REMARKS 

We propose an earnings forecast model (CVCS) that recognizes cost variability with 
sales changes and cost stickiness when sales decline. Unlike models of disaggregated earn- 
ings based on reporting classifications, our model stems from a management accounting 
perspective that emphasizes interactions among earnings components. The core of our 
model is the relation between sales and expenses embodied in the traditional fixed and 
variable cost behavior model. In addition, CVCS incorporates recent research that docu- 
ments sticky cost behavior—the fact that costs do not decrease as much with sales decreases 
as they increase with sales increases. 

Our CVCS model predicts one-year-ahead returns on equity better than do other models 
based on line items reported in income statements and statements of cash flows. Our CVCS 
model bridges over 30 percent of the information advantage financial analysts possess in 
forecasting one-year-ahead ROE over a basic time-series model that uses information on 
historical aggregate earnings. In contrast, the OPINC and CASHFLOW models bridge less 
than 6 percent of analysts’ superior forecast accuracy over the basic ROE model. Invoking 
the assumption of market efficiency, we find that our model performs better than the other 
three time-series models when evaluated based on the association between the earnings 
surprise measured by forecast errors relative to these models and contemporaneous abnor- 
mal stock returns. Thus, the evidence suggests that earnings forecasts using our CVCS 
model better represent the market’s earnings expectations than those using other models 
based on financial statement line items. 

Our study raises several interesting questions to be explored in future research. As our 
association analysis is predicated on the maintained assumption of market efficiency, we 
do not address the issue of whether the stock market fully captures the information content 
of cost variability and cost stickiness relevant for forecasting future earnings. Although cost 
variability is a concept that is generally well understood, evidence on cost stickiness was 
largely anecdotal until recently. The extent to which investors have rational expectations 
about the impact of cost stickiness on future earnings over a longer horizon remains an 
open question. 

Our objective is to consider a parsimonious model based only on income statement 
data. Promising research opportunities include expanding the scope of our model to incor- 
porate additional information such as new product introduction or plant closures available 
to investors that may impact the earnings forecasts. In addition, the manner in which we 
model the impact of sticky cost behavior could be extended to consider its impact on the 
amount of accruals and related balance sheet items in order to develop a better understand- 
ing of the valuation process. 

Our study documents that the simple cost variability and cost stickiness model has 
predictive content for the analysis of future profitability. Since such cost behavior models 
underlie some management accounting systems for budgeting and variance analysis, this 
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documentation is important in validating the basic premise supporting their design and 
indicating the importance of incorporating sticky cost behavior in improving their design. 
By emphasizing the importance of understanding cost behavior in forecasting earnings, our 
study points to the rich potential for future research integrating heretofore disparate streams 
of work in managerial and financial accounting. 
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ABSTRACT: Complex financial statement misstatements that are difficult to diagnose 
are likely to be described by multiple cues that individually appear innocuous, but that 
together form an ominous pattern. Because individual auditors are likely to obtain only 
some of the cues forming the pattern, it is important to understand how well auditors 
Interpret Incomplete cue patterns. In this paper, | examine experimentally whether 
Industry-specialist auditors develop problem representations about a seeded misstate- 
ment to facilitate Interpretation of Incomplete patterns. Overall, | find that matched 
specialists (.е., those working in their Industry) are able to Interpret and fill in partial- 
cue patterns, whereas mismatched specialists do not recognize the Implications of 
even full patterns. Matched’ specialists respond to a partial-cue pattern that potentially 
indicates misstatement by developing more complete problem representations about 
the seeded misstatement, assessing higher risk of misstatement, and suggesting pro- 
cedures that will be efficient and effective at discriminating the presence of the seeded 
misstatement. Despite higher assessments of misstatement risk when recelving a 
partial-cue pattern, mismatched specialists’ problem representations are impoverished 
and suggested procedures do not indicate a focus on the seeded misstatement. 


Keywords: auditor knowledge; industry specialization; pattem recognition; problem 
representations. 
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I. INTRODUCTION 

uditors’ failure to identify and report complex financial statement misstatements has 
Ам severe consequences for financial statement users and for auditors themselves. 

Financial statement misstatements that are complex! are hard to diagnose because 
they are likely to be described by a pattern of individually innocuous cues? The problem 
of diagnosing such misstatements is further complicated when the pattern is incomplete. In 
this study, I examine whether industry-specialist auditors’ unique knowledge structures al- 
low them to diagnose such misstatements. Specifically, I investigate whether specialists use 
problem representations about the misstatement contained in incomplete industry-specific 
patterns to identify the increased risk present and to direct the search for evidence about 
the misstatement. 

It is important to understand the determinants of how well auditors interpret incomplete 
patterns suggestive of misstatement. Auditors must make judgments and decisions during 
the planning and evidence collection stages of the audit with incomplete information. Ad- 
ditionally, the division of labor among audit team members suggests the possibility that 
different auditors will collect the pieces of information that form a pattern. Individual 
auditors, who likely have seen only part of the pattern, would not have enough information, 
on their own, to identify the potential increased risk of misstatement. Consequently, unless 
at least one team member receives all of the information forming the pattern, the risk of 
misstatement may be assessed too low or identification of a misstatement may be delayed, 
negatively impacting both audit effectiveness and efficiency. Moreover, because individual 
pieces of information are not, in themselves, very suggestive of misstatement, they are 
unlikely to be communicated.* 

I examine experimentally whether differences in industry specialists’ problem repre- 
sentations about a seeded misstatement lead to task-performance differences. Industry- 
specialist auditors complete two cases, one matching and one not matching (as a bench- 
mark) their industries of specialization. I manipulate whether they receive full, partial, or 
no patterns diagnostic of a misstatement embedded in case materials between participants. 
Auditors judge the likelihood of misstatement, explain the misstatement(s) about which 
they are concerned, determine necessary additional audit procedures, and perform surprise 
recalls for each case. 

Consistent with expectations, I find that matched specialists develop more complete 
problem representations about the seeded misstatement when they receive partial- or full- 
cue patterns than when they receive no-cue patterns. Additionally, matched specialists who 
receive partial- and full-cue patterns assess the likelihood of misstatement higher than 
matched specialists who receive no-cue patterns. Critically, a mediation analysis confirms 


Complexity denotes the number of alternatives or attributes that a decision maker has to consider. More alter- 
natives or attributes are associated with more complex decisions (Payne et al. 1993). 
? A pattern is composed of multiple pieces of information that must be combined, like the individual pieces of a 
mosaic, for the misstatement to emerge. When a pattern exists, no single piece of information is informative 
enough, on its own, to identify the misstatement. Rather, all of the cues forming the pattern must be perceived 
and appropriately processed for the misstatement to be conclusively identified. 
Problem representations are cognitive representations of complex phenomena that can be used to “run” thought 
experiments that allow inferences to be made (Christ 1993; Greeno 1989). 
* The review process will result in formalized information sharing; however, evidence from court documents 
suggests that the review process does not always result in complete information sharing. For example, testimony 
from Lincoln Savings and Loan reveals that for the Continental Ranch land sale the manager in charge of the 
audit of AMCOR (the subsidiary recording the sale) was not aware that Lincoln had loaned the down payment 
to the buyer. This condition should have prohibited AMCOR from recording the $9 million gain on sale. Lincoln 
and AMCOR had separate audit teams; only the partners on the engagement team had responsibility for both 
entities (Erickson et al. 1996). 
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the role played by problem representations for matched specialists. Specifically, problem 
representation completeness significantly mediates the influence of the pattern manipulation 
on matched specialists’ likelihood assessments. Matched specialists respond to a partial-cue 
pattern that potentially indicates misstatement by developing more complete problem rep- 
resentations about the seeded misstatement, assessing higher risk of misstatement, and sug- 
gesting procedures that will be efficient and effective at discriminating the presence of the 
seeded misstatement. 

In contrast, mismatched specialists generally do not develop more complete problem 
representations about the seeded misstatement, even when they receive full-cue patterns. 
Additionally, despite assessing higher misstatement risk when receiving a partial-cue pattern 
versus no-cue pattern, mismatched specialists’ suggested procedures do not focus on the 
seeded misstatement. 

Collectively, these results suggest that industry specialists will improve audit quality 
when industry-specific knowledge is necessary to identify a misstatement and when knowl- 
edge of general accounting and audit testing, and thinking by analogy are not adequate 
substitutes for this knowledge. I document that one situation for which such industry- 
specialist knowledge is necessary is misstatement identification when a partial error pattern 
is present. To my knowledge, this is the first рарег to investigate auditors’ ability to identify 
partial patterns diagnostic of misstatement. This is a skill that is critical for successfully 
identifying complex financial statement misstatements, especially those being purposefully 
concealed. 

The rest of this paper is organized as follows. The next section contains a review of 
the relevant literature and hypothesis development. In Section III, I describe the method 
that I use to test the hypotheses. In Section IV, I describe the experimental results. Finally, 
I discuss contributions and limitations. 


П. LITERATURE REVIEW AND HYPOTHESIS DEVELOPMENT 
Pattern Recognition 

The ability to process patterns or configurations of stimuli that are important to sub- 
sequent judgments or decisions is known as configural information processing. Auditing 
researchers have reported that auditors configurally process information provided that all 
cues are present and domain-specific knowledge implies such a configuration is appropriate 
(Brown and Solomon 1990, 1991). Under these stylized conditions, auditors configurally 
process information about control-risk assessment, audit-procedure evidence evaluation, and 
financial statement account comparison analysis. Auditors also process information config- 
urally when making estimates of extent of audit testing (Kerr and Ward 1994) and planning 
judgments (Maletta and Kida 1993). In addition, Bedard and Biggs (19912) report that 
auditors generally attempt to use all the cues forming a pattern diagnostic of an overhead 
application error; however, they have difficulty generating correct hypotheses about the 
cause of the misstatement. 

When patterns of information reveal misstatement, auditors risk issuing incorrect audit 
opinions unless they recognize the components of such patterns. When a multi cue pattern 
exists that is diagnostic of a material misstatement, auditors may receive none, some, or 
all of the cues. Previous research (e.g., Brown and Solomon 1990, 1991) that investigated 
auditors' ability to configurally process information manipulated the level of the cues re- 
ceived for a complete pattern (e.g., sales increased or sales decreased, a control was known 
to be present or absent). In contrast, I investigate how auditors respond to partial patterns 
of cues. If industry-specialist auditors possess superior knowledge about the industry in 
which they specialize and can use this knowledge to elaborate about the conditions under 
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which missing information would be diagnostic of misstatement, they should be likely to 
identify a misstatement from a partial pattern. 

Since auditors likely receive incomplete patterns, it is important that they can assess 
the related potential misstatement implications. To date, the literature does not address 
whether auditors are able to process patterns when the presence or absence of cues is 
manipulated (e.g., there is or is not information available about whether a specific control 
is in place to prevent a misstatement that could occur given the presence of another cue, 
and this information would be diagnostic about the risk of misstatement). 

The question about this skill is especially important given the multi-person environment 
in which auditors work. This environment makes it likely that individual audit-team mem- 
bers receive partial-cue patterns. While the team collectively may receive all of the cues, 
no single team member may possess the full pattern. The team member receiving each cue 
must recognize its potential importance or communicate it to other team members in order 
to detect the misstatement. Alternatively, a single auditor ultimately may collect all the cues 
forming a pattern; identification of the pattern will be delayed until all the cues are collected 
if the auditor does not recognize the incomplete pattern. 

Even if recognized, cues that comprise the pattern may go uncommunicated. Several 
features of the audit environment suggest that auditors are unlikely to communicate or share 
the information only they hold (i.e., unique information) with other team members, consis- 
tent with findings in psychology (Stasser and Titus 1985). First, auditors work in very high 
information-load conditions and such conditions reduce the likelihood that unique infor- 
mation is shared with other teammates (i.e., information sharing) (Stasser and Titus 1987). 
Second, larger auditees require larger audit teams and increases in group size reduce in- 
formation sharing (Stasser et al. 1989). Finally, many audit tasks are “judgment” tasks for 
which a demonstrably correct answer does not exist (Kennedy et al. 1997). Stasser and 
Stewart (1992) report that when participants believe that their task has a demonstrably 
correct answer, they share more unique information than when they believe their task is a 
judgment task. Since conditions in the audit environment likely lead to low levels of unique 
information sharing, identification of partial patterns suggestive of misstatement is very 
important. 


Industry-Specialists’ Knowledge 

Industry specialists are those individuals so designated by their firms and whose training 
and experience are mainly in a particular industry. Some. recent research has investigated 
performance by industry specialists and auditors with high recent experience in a domain. 
For example, Owhoso et al. (2002) report that matched industry-specialist auditors detect 
more errors during the review process than do mismatched specialists. Taylor (2000) reports 
that banking industry-specialist auditors’ inherent risk assessments differ (do not differ) 
from those of nonspecialists when evaluating loans receivable (property, plant, and equip- 
ment); he does not investigate whether these differences lead to differences in planned 
procedures. Bedard and Wright (1994) report that high-domain-experience auditors: are 
more likely to react to current risk factors when planning audit procedures than are low- 
domain-experience auditors. Additionally, auditors with high-domain-experience generate 
more plausible hypotheses of likely errors than do auditors with low-domain-experience 
(Wright and Bedard 2000; Wright and Wright 1997). | 

However, matched specialists do not uniformly outperform mismatched specialists. 
Bedard and Biggs (1991b) report that high-domain-experience auditors who receive a cor- 
rect management explanation are more likely than low-domain-experience auditors to rec- 
ognize it as the correct hypothesis to explain a pattern. However, there is no difference 
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between high- and -low-domain-experience auditors when the management explanation is 
incomplete or missing, as is the case in the current study, suggesting that the benefits of 
specialization depend on the features of the situation. Finally, in the only study to explicitly 
investigate specialists’ knowledge to date, Solomon et al. (1999) report that matched in- 
dustry-specialist auditors know more non-errors and non-error frequencies than do mis- 
matched industry specialists; however, results for error knowledge are mixed. This surpris- 
ing result leads to concern that what industry specialists may learn is to explain away 
observed differences as non-errors, rather than recognize that they are errors and potentially 
affect audit effectiveness. This is consistent with the GAO’s concern that industry specialists 
are too close to their clients and cannot objectively evaluate evidence (Carmichael 1997). 

The prior literature suggests that industry experience is helpful in improving audit 
performance under some circumstances. I contend that use of industry specialists will im- 
prove audit quality when industry-specific knowledge is necessary to identify a misstate- 
ment and when knowledge of general accounting and audit testing, and thinking by analogy 
are not adequate substitutes for this knowledge. I contend that one situation for which such 
industry-specialist knowledge is necessary is misstatement identification when a partial error 
pattern is present. Specialists’ cognitive representations may allow them to draw inferences 
from partial patterns of cues that are diagnostic of misstatement in their area of expertise. 

Problem representations are cognitive representations of complex phenomena that 
people use to “тип” thought experiments that allow inferences to be made (Greeno 
1989). Problem representations are a function of external cues currently attended to and 
knowledge stored in memory (Brewer 1987). Problem representations develop naturally 
with experience in a domain and contain the causal features of the domain that they rep- 
resent (Norman 1983; Rumelhart and Norman 1988). Domain knowledge plays an important 
role in the development of problem representations (Fincher-Kiefer et al. 1988; Spilich et 
al. 1979; Dutke 1986). Knowledge enables people to interpret information, allowing high- 
knowledge people to provide coherence and more extensive and accurate interpretations 
than low-knowledge people (Fincher-Kiefer et al. 1988; Voss et al. 1980). Additionally, 
high-knowledge people develop problem representations that are more elaborated that low- 
knowledge people (Tardieu et al. 1992). 

In auditing, Christ (1993) investigated problem representation development in auditors 
of different ranks by examining free recalls. Her case did not contain a seeded misstatement 
or other manipulation of content that would cause an expected shift in focus of her partic- 
ipants. However, she reports that more experienced auditors’ free recalls contain more 
abstractions (1.е., deeper principles) than less experienced auditors. I build on Christ's 
(1993) work in that I expect participants will focus on the seeded misstatements in my 
cases. I expect matched specialists to better interpret information about their industry than 
mismatched specialists when industry-specific knowledge, such as knowledge of unique 
accounting or reporting requirements or business practices, is required. Therefore, they 
should create better-developed problem representations focused on a seeded misstatement 
than do mismatched auditors. Bierstaker et al. (1999) report that problem representations 
that are focused on productive areas are critical to identifying seeded errors in analytical 
review. 

Evidence about the existence and completeness of problem representations can be found 
in explanations and surprise recalls. People base their explanations, judgments, and recall 
of target events on their problem representation of the event. Problem representations con- 
tain the substance of the facts as well as inferences people make and believe to be part of 
the original event (Johnson-Laird 1983). Similarly, matched specialist auditors with knowl- 
edge of industry accounting practices and conditions will likely integrate case information 
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related to a possible misstatement and consequently recall information focused on the 
seeded misstatement area. Additionally, matched specialists will elaborate about the impli- 
cations of the information in the pattern and will recall more relationships between facts 
contained in the case and inferences about implications of the facts in the case (Christ 
1993). Matched specialists’ elaborations about the case will also result in explanacions about 
what (if anything) could be wrong that are focused on the misstatement area ani diagnose 
the seeded misstatement. 

Therefore, I expect matched specialists who receive full- and partial-cue patterns to be 
able to understand the relationships among the cues. Recipients of partial-cue patterns are 
expected to be able to provide coherence to the incomplete pattern. This should lead partial- 
cue pattern recipients to elaborate about the incomplete pattern and hypothesize about the 
missing information. They may even infer the existence of the missing piece of the cue 
pattern. Matched specialists who receive none of the cue pattern are not expected to elab- 
orate about the unstated error pattern. As a result, matched specialists who гесегуе partial- 
cue patterns are expected to have better-developed problem representations about the seeded 
misstatement than will matched specialists who receive no-cue patterns. 

Matched specialists who receive complete cue patterns are also expected to elaborate 
about the cue pattern and the error it suggests. As a result, matched specialists who receive 
complete cue patterns will have better developed problem representations about Ше seeded 
misstatement than will those who receive no-cue patterns; matched specialists who receive 
full-cue patterns will have problem representations that are at least as well developed as 
matched specialists who receive partial-cue patterns. Figure 1 depicts my predictions about 
problem representation development. More formally, I predict (all hypotheses are stated in 
alternative form): 


Hla: Matched specialists receiving partial- and full-cue patterns will have better de- 
veloped problem representations about the seeded misstatement than will matched 
specialists receiving no-cue patterns. 


Mismatched specialists who receive partial-cue patterns are not expected to be able to 
fully understand the implications of the incomplete information. They are not expected to 
elaborate on the incomplete pattern because they are not expected to recognize its signifi- 
cance. Their problem representations are not expected to differ in level of development 
from mismatched specialists who receive no-cue patterns. Mismatched specialis-s who re- 
ceive full-cue patterns may be able to interpret the patterns because they may have enough 
information, combined with their general accounting knowledge, to deduce the misstate- 
ment. Consequently, if they recognize the significance of the pattern, then they are expected 
to develop problem representations that are better developed than mismatched specialists 
who receive the partial-cue pattern. If they are unable to recognize the significance of the 
pattern because only industry-specific knowledge is sufficient to identify the pa-tern, then 
their problem representations will be at least as well developed as those of maiched spe- 
cialists who receive partial-cue patterns. More formally, I predict: 


Hib: Mismatched specialists receiving full-cue patterns will have problem represen- 
tations about the seeded misstatement that are at least as well developed as those 
of mismatched specialists who receive no- or partial-cue-patterns. 
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FIGURE 1 
Hypothesis 1 


Hia contrasts: A < (B + Су2 
Hib contrasts: (D + Буг = Е 
Hic contrasts: B > E 
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Industry match is operationalized at two levels within participants. Matched participants are those working in 
their industry of specialization; mismatched participants are those working outside their industry of 
specialization. 

The cue-pattern condition is cperationalized at three levels between participants. Participants in the None 
condition received no cues fram the seeded misstatement pattern. Participants in the Partial (Full) condition 
received two (all three) cues from the seeded misstatement pattern. 

The dependent variable is the sum of four standardized measures of the completeness of the participants’ 
problem representations about the seeded misstatement. A higher number indicates better developed problem 


representations. 


The above discussion suggests that the key difference between matched and mismatched 
specialists will occur when they receive partial-cue patterns. Matched specialists who re- 
ceive partial-cue patterns are expected to understand the relationship among the cues and 
provide coherence to incomplete patterns resulting in well-developed problem representa- 
tions about the seeded misstatement. Mismatched specialists who receive partial-cue pat- 
terns are not expected to understand these relationships, resulting in relatively impoverished 
problem representations about the seeded misstatement. More formally, I predict: 


Ніс: Matched specialists receiving partial-cue patterns will have better developed prob- 
lem representations about the seeded misstatement than will mismatched special- 
ists receiving partial-cue patterns. 
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Likelihood Assessments 

Matched specialists likely possess knowledge of the patterns of business-operations 
information that would be diagnostic of misstatement, such as departures from expectations 
about "normal" conditions. Knowledge of these patterns may be acquired directly by ex- 
periencing the pattern on audits or indirectly from training sessions or discussions with 
colleagues. Auditors with this knowledge would have the cues that comprise the pattern 
accessibly stored in memory with connections developed between the cues that compose 
the pattern. Therefore, even if specialists receive onlv some of the cues comprising the 
pattern, it is likely that they would recognize the elevated potential for misstatement. Ad- 
ditionally, auditors with well-developed problem representations likely will elaborate about 
the conditions under which the information heightens the risk of material misstatement, 
even if they have not previously encountered the misstatement. 

Matched specialists who receive partial- and full-cue patterns suggestive cf misstate- 
ment will assess the likelihood of misstatement conditioned on two factors: the perceived 
diagnosticity of the partial-cue pattern about error and the conditional likelihood of the 
error(s) they are concerned about relative to other possible non-error conditions given the 
partial pattern. Matched specialists who receive no-cue patterns suggestive of misstatement 
are not expected to perceive the unstated error risk. Therefore, I expect that matched spe- 
cialists who receive partial-cue patterns will assess the likelihood of misstatement higher 
than matched specialists who receive no-cue patterns. 

Matched specialists who receive full-cue patterns will assess the likelihood of misstate- 
ment higher than those who receive no-cue patterns; however, their likelihood assessments 
relative to those in the partial-cue pattern condition will depend on the relative diagnosticity 
of the partial- and full-cue patterns. When the full pattern is much more diagnostic of 
misstatement than is the partial pattern, those receiving the full pattern will produce like- 
lihood assessments that are higher than those receiving partial patterns. When the full 
pattern is only marginally more diagnostic of misstatement than is the partial pattern, those 
receiving full patterns will assess the likelihood of misstatement at least as high as those 
who receive partial-cue patterns. Overall I expect auditors' likelihood assessmerts to be of 
the form depicted in Figure 2 and as stated more formally below. 


H2a: Matched specialists receiving partial- and full-cue patterns will assess higher like- 
lihoods of misstatement than will matched specialists receiving no-cue patterns. 


Mismatched specialists who receive partial-cue patterns are also expected to assess the 
likelihood of misstatement conditioned on the perceived diagnosticity of the partial-cue 
pattern about error and the conditional likelihood of the error(s) they are concerned about 
relative to other possible non-error conditions given the partial pattern. However, because 
they are expected to develop relatively impoverished problem representations, mismatched 
specialists who receive partial-cue patterns are not likely to perceive the diagrosticity of 
the partial-cue pattern and, thus, are not expected to increase their assessment. of risk of 
material misstatement. Their likelihood assessments are not expected to differ “rom those 
of mismatched specialists who receive no-cue patterns. Mismatched specialists who receive 
full-cue patterns may perceive the diagnosticity of the pattern for misstatement and if they 
do, they are expected to increase their assessed likelihood of misstatement accordingly. If 
they are not able to recognize the diagnosticity of the pattern for misstatement, their like- 
lihood assessments are not expected to differ from those of mismatched specialists who 
receive partial-cue patterns. More formally, for mismatched specialists I predict 
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FIGURE 2 
Hypotheses 2 and 4 


Н2а and Нада contrasts: А < (В + С) 
H2b and H4b contrasts: (D + EV2 = Е 
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See Figure 1 for explanations of industry match and cue-pattern variables. 

The dependent variable for H2 is the likelihood that a material misstatement was contained in the financial 
statements. Participants rate this likelihood on a 101-point scale anchored by 0 (No chance of material 
misstatement) and 100 (Certain of material misstatement). I present two graphs as theory does not specify the 
relative positions of the lines for matched and mismatched participants for this dependent variable. 

The dependent variable for НЕ is the total time allocated to procedures coded as discriminatory. Participants 
listed procedures they believed were necessary to determine whether the potential misstatement(s) they had listed 
existed and time necessary to complete those procedures. 


H2b: Mismatched specialists receiving full-cue patterns will assess the likelihood of 
misstatement at least as high as mismatched specialists receiving no- or partial- 
cue patterns. 


The preceding discussion specifies predictions about likelihood assessments of mis- 
matched specialists relative to other mismatched specialists. I make no predictions about 
likelihood assessments of matched versus mismatched specialists as this comparison is not 
clear ex ante. For example, one response to the uncertainty associated with being mis- 
matched may be to assess the likelihood of misstatement higher overall. 

Above I examined the predicted impact of pattern completeness on likelihood assess- 
ments. The discussion hypothesized that pattern completeness will affect likelihood 
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assessments in an indirect manner through the effect of pattern completeness on specialists’ 
problem representation development. I expect that problem representation development will 
directly affect specialists’ likelihood assessments. More formally, I predict: 


H3: Matched specialists’ problem representations about the seeded misstatement are 
expected to mediate the relationship between the pattern completeness manipula- 
tion and likelihood assessments. 


Audit Procedures 


As discussed earlier, the receipt of cues that are diagnostic of misstatement is expected 
to affect industry specialists’ assessed likelihood of misstatement. Industry specialists who 
suspect a misstatement pattern likely seek information that can be used to determine 
whether the full pattern exists and the extent of any misstatement. Specialists who believe 
that there is a relatively higher risk of misstatement are expected to know what industry- 
specific conditions would give rise to a misstatement, what tests should be performed, and 
what evidence should be gathered to determine the existence and extent of misstatement. 

More experienced auditors perform more directed searches for information and choose 
procedures that yield evidence that better discriminates among possible causes of unex- 
pected differences than do less experienced auditors (Biggs et al. 1988). Similarly, I expect 
specialists who are more concerned about a particular misstatement to propose a more 
directed search for evidence about that misstatement (i.e., to seek information wi-h a greater 
ability to discriminate between the hypothesized error and a non-error). More audit time, 
therefore, should be allocated to evidence with higher discriminatory power. 

Matched specialists who receive partial-cue patterns will elaborate about the incomplete 
pattern, hypothesize about the missing information, and conduct more directed searches for 
evidence than will matched specialists who receive no-cue patterns. Consequently, I expect 
these specialists to allocate more time to procedures that will discriminate whether the 
seeded misstatement is present than will matched specialists who receive по-сиз patterns. 

Matched specialists who receive complete cue patterns are also expected to elaborate 
about the cue pattern and the error it suggests. As a result, matched specialists who receive 
complete cue patterns will allocate more time to procedures that will discriminete whether 
the seeded misstatement is present than those who receive no-cue patterns; however, the 
amount of time allocated relative to those in the partial-cue pattern condition will depend 
on the relative diagnosticity of the partial- and full-cue patterns. When the full pattern is 
much more diagnostic of misstatement than is the partial pattern, those receiving full pat- 
terns will allocate more time to discriminatory procedures than those receiving partial 
patterns. However, when the full pattern is only marginally more diagnostic of misstatement 
than is the partial pattern, those receiving full patterns will allocate at least as much time 
to discriminatory procedures as will those receiving partial patterns. Figure 2 depicts my 
predictions about time allocated to discriminatory procedures. More formally, I predict: 


Нада: Matched specialists receiving full- and partial-cue patterns will allocate more time 
to procedures that will discriminate whether the seeded misstatement is present 
than will matched specialists receiving no-cue patterns. 


5 Ido not make a similar mediation prediction for mismatched specialists as I do not expect there will be sufficient 
variation in mismatched specialists’ problem representation development or likelihood assessments to allow 
detection of such a relationship. 
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Mismatched specialists who receive partial-cue patterns are not expected to be able to 
interpret the pattern, nor are they expected to understand the industry-specific conditions 
that would give rise to the misstatement. As such, they are not expected to have a particular 
misstatement in mind about which they will seek evidence. Therefore, mismatched spe- 
cialists who receive partial-cue patterns are not expected to differentially allocate time to 
procedures that will discriminate whether the target misstatement is present compared to 
mismatched specialists who receive no-cue patterns. Mismatched specialists who receive 
full-cue patterns may be able to identify the potential misstatement and may be able to 
conduct directed searches for evidence if their general knowledge of audit testing allows 
them to design tests that would discriminate the misstatement. If only industry-specific 
knowledge is sufficient to design tests that will discriminate the presence of the seeded 
misstatement, then they will allocate at least as much time to discriminatory procedures as 
mismatched specialists who receive partial-cue patterns. Stated more formally, these pre- 
dictions are: 


НАФ: Mismatched specialists receiving full-cue patterns will allocate at least as much 
time to procedures that will discriminate whether the seeded misstatement is 
present as will mismatched specialists receiving no- or partial-cue patterns. 


Specialists who believe that there is a relatively low risk of misstatement are unlikely 
to have specific misstatements in mind. Consequently, they will be relatively unlikely to 
identify audit procedures that would provide evidence about the presence of a particular 
misstatement. Less experienced auditors generally perform ill-defined searches and choose 
audit procedures that poorly discriminate among possible causes of audit fluctuations (Biggs 
et al. 1988). Consequently, I expect mismatched auditors to allocate more time to audit 
procedures with relatively low ability to discriminate whether a misstatement is present 
than do matched specialists. Stated more formally, this prediction is: 


H5: Mismatched specialists will allocate more time to procedures unlikely to discrim- 
inate the presence of misstatement than will matched specialists. 


Ш. RESEARCH METHOD 
Participants 
Participants volunteered after being identified as firm-designated industry-specialist au- 
ditors in banking or state and local governments. Seventy-four industry-specialist auditors 
from four Big 5 firms participated in one of 19 sessions of the experiment;*® responses from 
65 auditors are reported herein. As summarized in Table 1, participants’ mean audit ex- 
perience is 83.82 months. Participants are partners/principals (n = 10), senior managers (n 


5 І collected data from 44 participants pre-Enron (35 banking specialists and 9 government specialists) and 21 
government specialists post-Enron. Because of the potential effect on the data I performed careful analysis to 
ensure that there were no systematic differences across time periods. I estimated all of the models including 
timing of data collection (pre- or post-Enron) as a factor nested within specialty. The timing of data collection 
was not significant in any of the models for any dependent variable and the timing did not interact with the 
other independent variables in the models. Additionally, I re-estimated the contrasts used to test the hypotheses 
and all statistical inferences hold when timing of data collection is included in the models. Consequently, I 
conclude that timing of data collection did not impact the dependent variables and it is appropriate to exclude 
timing from the models and report the more parsimonious models. 

7 JT exclude nine participants because they reported either significant experience in both industries (n = 2) or no 
significant experience in either industry (n — 7). 
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TABLE 1 
Participant Demographics 
All Banking Government 
Participants Specialists Specialists 
(n = 65) (n-35 (0 30) 
Months of audit experience [Mean (Std. Dev.)] 83.82 71.46 98.23 
(69.87) (62.40) ` (76.22) 
Title 
Seniors 30 19 11 
Managers 14 9 5 
Senior Managers 11 2 9 
Partners /Principals 10 5 5 
Months experience auditing 
Banks 49.09 4.87 
(52.75) (15.62) 
State and local governments 0.57 72.67 
(2.16) (78.94) 
Percent of time in past year auditing 
Banks 63.46% 0.03% 
(29.79) (0.18) 
State and local governments 0.43% 56.60% 
(2.54) (29.63) 





= 11), managers (п = 14), and seniors (п = 30).8 Banking (government) specizlists report 
having an average of 49.09 (4.87) months of experience auditing banking сЇїеп:5 and 0.57 
(72.67) months of experience auditing government clients. Banking (government) special- 
ists report spending an average of 63.46 percent (0.03 percent) of their time in the past 
year auditing banking clients and 0.43 percent (56.60 percent) of their time in the past year 
auditing government clients. 


Task 


The experimental instrument contains two cases, one bank and one city-government 
case, developed in consultation with audit partners who specialize in each indusiry. I chose 
these industries to minimize expected knowledge transfer, thus providing a strong test of 
the effects of industry-knowledge differences on performance. These industries have unique 
accounting issues and the industries are not included in the same industry category by any 
of the Big 5 accounting firms. For each case participants were asked to: read the case, 
assess the likelihood of material misstatement, report what (if any) misstatement they were 
concerned about, list procedures they would perform and audit hours necessary tc determine 
whether their noted misstatements were present, and recall all important information from 
each case. After completing the second case, participants answered a post-experimental 
questionnaire that contained manipulation checks and demographic questions. 


* The participating firms designate qualifying auditors as industry specialists beginning at the senior level. I reran 
all the analysis for H1, H2, H4, and H5 including total months of audit experience as a covariate. Total months 
of audit experience was not significant in any of the models as a covariate and it did not interact with any of 
the other variables in the models. Additionally, all inferences hold when the variable is included. This analysis 
provides evidence that the results are not driven by the more experienced participants but can be attributed to 
auditor specialization. 
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Participants received unaudited revenue numbers and background information about 
two revenue items that comprised the area under audit in each case. The cases are presented 
in Appendices A and B. Each case contained a seeded material misstatement of one of the 
two revenue items discussed. The revenue recognition problem of interest was unique to, 
but common within, the target industry. This ensured that industry knowledge, rather than 
general business or accounting knowledge, was necessary to identify the risk of misstate- 
ment.’ A series ої manipulation-check questions confirm that matched specialists knew the 
appropriate accounting for the different types of revenue and that mismatched specialists 
were not able to reason by analogy to determine the correct accounting for the target 
revenue areas. In the post-experimental questionnaire, participants were asked to recognize 
the appropriate accounting method for each of the five types of revenue that appear in the 
cases (i.e., successful loan application fees, unsuccessful loan application fees, credit card 
fees, grant revenue, and sewer taxes). Participants answered on an 11-point scale anchored 
' by 0 (certain does not comply with GAAP) and 10 (certain does comply with GAAP). The 
target revenues in the case were the successful loan application fees and the grant revenue. 
These items required knowledge of an industry-specific accounting method and of the 
characteristics that allow one to determine the correct application of the method.'? I ex- 
pected this knowledge would not easily be transferred from another domain or solved by 
analogy. The non-target revenue types, although still specific to the industry, did not require 
an industry-specific accounting method. I expected that participants, using their general 
accounting knowledge, would be able to reason by analogy and be confident that they knew 
how to account for the non-target revenues. 

. The data support these expectations. Government specialists rated the appropriate ac- 
counting method for the grant revenue higher (7.80) than did the banking specialists (3.47) 
(p = 0.00); banking specialists rated the appropriate accounting method for the successful 
loan application-fee revenue higher (5.98) than did the government specialists (3.82) (p 
— 0.01). These differences in knowledge of the correct accounting for the target revenue 
items suggest that this knowledge was necessary to allow matched specialists to develop 
more complete problem representations about the target misstatements. Additionally, it pro- 
vides evidence that mismatched specialists were not able to transfer knowledge or reason 
by analogy to determine the correct accounting treatments for the target revenue areas. 
There were no significant differences between the two groups of specialists on the other 
three revenue measures.!! This suggests that mismatched specialists: were able to transfer 
some knowledge or reason by analogy and apply it to these non-target revenues. 

Participants received the experimental materials in a set of six numbered envelopes. 
They were instructed to work at their own pace, to return each completed document to its 
envelope, and to close it before opening the next envelope. A proctor ensured participants 
complied with instructions during all sessions of the experiment. On average, participants 
completed the materials in about 75 minutes. 


9 Matched specialists had previously encountered the seeded errors 16.9 percent of the time; mismatched spe- 
cialists had encountered the seeded errors only 1. 5 percent of the time. This difference is significant (p « 0.01), 
confirming that matched specialists had more experience with the seeded errors than did mismatched specialists. 

1? For the banking case, participants had to determine, from the characteristics given, whether the loan fees were 
direct; for the government case, participants had to determine, from the characteristics given, whether the grant 
was reimbursement driven. 

!! Government specialists did not rate the appropriate accounting method for the sewer taxes higher (6.06) than : 
did the banking specialists (5.19) (p — 0.32). Banking specialists did not rate the appropriate accounting method 
for the unsuccessful loan application-fee revenue higher (8.71) than did the government specialists (8.05) (p 
= 0.22); banking specialists did not rate the appropriate accounting method for credit card fees higher (8.71) 
than did the government specialists (8.09) (p — 0.26). 
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Independent Variables 


There are two independent variables: industry match and completeness of the misstate- 
ment cue pattern.'? I create the two levels of industry match (i.e., matched and mismatched) 
by manipulating context and measuring participant industry specialization. I manipulate 
context within-participants at two levels. Each participant evaluates one banking case and 
one city-government case.!? I measure participants’ industry of specialization; participants 
specialize in auditing banks or state and local governments. I manipulate completeness of 
the misstatement cue pattern between-participants. The misstatement pattern for each case 
consists of three cues whose presence or absence in the case is manipulated at three levels 
(i.e., no, zero cues present; partial, two cues present; and full, all three cues present). The 
pattern cues must be combined with the case background information to determine whether 
a misstatement exists. The background information, on its own, provides no reason for 
participants to suspect the misstatement. The partial pattern for each case provides infor- 
mation that allows participants to start forming an expectation about the amount of revenue 
that should be recognized and to determine that under certain circumstances there is a 
particular misstatement. However, they must receive the information in the іс cue to be 
certain of the misstatement. The third cue for each case contains information abont a client- 
specific accounting convention. This information, combined with participant knowledge 
about how the transaction should be accounted for, allows participants to determine the 
correct misstatement. 


Dependent Variables 


Three classes of dependent variables were measured: problem representation measures, 
likelihood assessments, and total time allocated to procedures that will or will not discrim- 
inate the misstatement. I used four different problem representation measures to create the 
overall measures used to test H1 and H3: recall content, recall character, explanation con- 
tent, and explanation direction. Higher scores are consistent with better-developed problem 
representations. 

І computed the recall content and character measures from participants’ listing of all 
the important information they could remember from the case. Participants were instructed 
not to look at the case materials while performing this task. I parsed the recalls into idea 
units, and a doctoral student with auditing experience and I worked independently to code 
the recalls and all other items for the project. The doctoral-student coder was blind to 
participant specialty and the hypotheses for all coding for this project. 

We coded recall idea units on two drmensions: content and character. Conteat of each 
recall idea unit was coded as relating to the misstatement area or relating to other areas of 
the case. Items coded as related to the misstatement area were further coded zs to their 
character; each idea unit was coded as expressing either case facts or relationships and 
inferences from the case. Participants who were better able to interpret the case scenario 


12 I also manipulate the order of the cases; half of the subjects receive the banking case first and hal? receive Ше 
government case first. Except as noted there are no predicted or observed order effects, so I combine all 
subsequent analysis across order conditions. 

13 Т also analyzed the results by case and the results are qualitatively the same as those reported in the paper. The 
predictions, in general, hold for each hypothesis for each case, however, there is reduced significance in some 
instances due to the reduction in sample size and resulting statistical power. I had no ex ante predictions or 
theoretical reason to investigate differences due to a specific industry, so I report the results combined over the 
two industries to take advantage of the experimental design and rule out certain alternative explanations for my 
results (e.g., that something idiosyncratic about one case or specialty is driving the average results, or that 
specialists in one industry have higher or lower ability and this drives the everage result). 
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as indicating misstatement were expected to recall more items related to the misstatement 
area. These participants also were expected to make connections among facts presented and 
include more relationships and inferences related to the misstatement area in their recalls. 
Inter-rater agreement is 97.9 percent for recall content and 89.3 percent for character. 
Cohen’s Kappa, a measure of inter-rater agreement over and above that expected by random 
agreement, is 0.969 and 0.626 for content and character, respectively; both are significantly 
different from zero (p < 0.001). All coding differences for the project were mutually 
resolved. 

I computed the explanation content and direction measures from participants’ expla- 
nations of the misstatement(s), if any, about which they were most concerned. We coded 
these responses on two dimensions: content and direction. The content of each explanation 
idea unit was coded as relating to the misstatement area or other areas of the case. The 
direction of each explanation idea unit was coded as supporting or not supporting the seeded 
misstatement about the target revenue area. Participants who were better able to interpret 
the case scenario and who focused on the seeded misstatement were expected to explain 
more items that relate to the target revenue area. The participants also were expected to 
include more items that explain the seeded misstatement. Inter-rater agreement is 90.6 
percent for explanation content and 79.3 percent for direction. Cohen’s Kappa is 0.857 and 
0.683 for content and direction, respectively; both are significantly different from zero (p 
< 0.001). 

After reading each case and before listing the recalls, participants rated the likelihood 
that a material misstatement was contained in the financial statements. The likelihood as- 
sessment is the dependent variable for H2. Participants were free to look back at the case 
materials while making this assessment. The likelihood assessment is elicited on a 101- 
point scale anchored by 0 (no chance of material misstatement) and 100 (certain of material 
misstatement). : 

Finally, participants listed procedures and specified the audit time necessary to deter- 
mine whether the potential misstatements they had previously identified were present. We 
coded these procedures on the ability to discriminate the target misstatement. The total time 
allocated to discriminatory procedures is the dependent measure for H4; total time allocated 
to nondiscriminatory procedures is the dependent measure for H5. Procedures were coded 
as discriminatory if they would provide direct evidence about whether the target misstate- 
ment is present; procedures were coded as nondiscriminatory if they would not provide 
such evidence or if they would provide evidence about the non-target revenue area. Inter- 
rater agreement is 79.5 percent; Cohen's Kappa is 0.514 and is significantly different from 
zero (p « 0.001). 


IV. RESULTS 

Hypothesis 1 

In Hla, I predict that matched participants’ problem representations about the seeded 
misstatement will be better developed when they receive full- and partial-cue patterns than 
when they receive no-cue patterns. In Hib, I predict that mismatched participants’ problem 
representations about the seeded misstatement will be at least as well developed when they 
receive full-cue patterns as when they receive no- or partial-cue patterns. Finally in Hlc, I 
predict that matched specialists receiving partial-cue patterns will have better-developed 
problem representations about the seeded misstatement than will mismatched specialists 
receiving partial-cue patterns. The hypotheses predict specific patterns of cell means, so I 
use contrasts as the primary tests of the hypotheses (Buckless and Ravenscroft 1990). 
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I estimate a repeated-measures ANOVA with the overall problem representation mea- 
sures as dependent variables and industry match (within participant) and cue pattern con- 
dition as independent variables. I present the means and the results of the planned contrasts 
in Table 2, Panels A and B, respectively. Matched participants who receive full- or partial- 
cue patterns have better developed problem representations for the seeded misstatement 
(mean 1.26) than do matched participants in the no-cue pattern condition (—0.51) and this 
difference is significant (Е, сг = 7.545, one-sided p = 0.004), supporting Hla, Additionally, 
a post hoc test reveals that the problem representation development of participants in the 


TABLE 2 
Problem Representation Measures 
LS Mean 


Hla contrast: А < (B + C)/2 
Hib contrast: (D + E)/2 = Е 
Hic contrast: B > E 


Panel A: Problem Representation Measures 


Cue Pattern 
No Partial 
п = 21 = 23 
Matched Recall Content —0.09 0.13 
Recall Character 0.20 0.15 
Explanation Content —0.44 0.01 
Explanation Direction —0.17 0.59 
Overall Measure —0.51 0.89 
(SE) (0.53) (0.51) 
Cell A B 
Mismatched Recall Content —0.46 —0.19 
Recall Character —0.14 —0.16 
Explanation Content —0.17 —0.09 
Explanation Direction -0.62 -0.37 
Overall Measure —1.39 —0.80 
(SE) (0.45) (0.43) 
Cell D E 
Panel B: Tests of H1 
Planned Contrasts Е а 
Matched Мо < (Matched Partial + Full)/2 7.545 
(Mismatched No + Partial)/2 = Mismatched Full 1.293 
Matched Partial > Mismatched Partial 7.856 


Industry match is operationalized at two levels within participants. Matched participants are those working in 


their industry of specialization; mismatched participants are those working outside their industry of 


specialization. 


The cue-pattern condition is operationalized at three levels between participants. Participants in the No Cue 
Pattern condition received no cues from the seeded misstatement pattern. Participants in the Partial- (Full-Cue) 


Pattern condition received two (all three) cues from the seeded misstatement pattern. 


The dependent variable is computed by standardizing the four problem representation measures (Recall content, 
Recall character, Explanation content, and Explanation direction) using Z-scores and summing to compute 


overall problem representation measures. 
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partial-cue pattern condition (0.89) is greater than that of those in the no-cue pattern con- 
dition (Е, с = 3.622, one-sided р = 0.031). Mismatched participants who receive full-cue 
patterns have problem representations for the seeded misstatement (—0.48) that are at least 
as well developed as those of mismatched participants in the no- or partial-cue pattern 
conditions (—1.10) (F, „„ = 1.293, one-sided р = 0.130), consistent with H1b. Finally, 
matched participants in the partial pattern condition have better developed problem repre- 
sentations for the seeded misstatement (0.89) than do mismatched participants in the partial 
pattern condition (—0.80) (Е, „ = 7.856, one-sided р = 0.004), supporting Hic. 


Recall and Explanation Problem Representation Measures 

I also investigate whether the overall problem representation results are consistent 
across its components by disaggregating the overall measure into recall and explanation 
problem representation measures. I compute a total recall problem representation measure 
by summing the recall content and character measures. I compute a total explanation prob- 
lem representation measure by summing the explanation content and direction measures. I 
estimate a repeated-measures ANOVA with the total recall and total explanation problem 
representation measures as dependent variables and industry match (within participant) and 
cue pattern condition as independent variables. Matched participants who receive full- or 
partial-cue patterns have more developed recall problem representation measures related to 
the seeded misstatement (mean 0.50) than do matched participants in the no-cue pattern 
condition (0.11) but this difference is not significant (Е, „ = 0.707, one-sided р = 0.202). 
The pattern of means is consistent with the hypothesis for the recall content dimension of 
this measure. The insignificant result appears to be driven by the character dimension 
of this measure; matched specialists appear to generate inferences and relationships related 
to the misstatement area even in the no-cue pattern condition. 

Matched participants who receive full- or partial-cue patterns have more developed 
explanation problem representation measures related to the seeded misstatement (mean 
0.76) than do matched participants in the no-cue pattern condition (—0.61) and this differ- 
ence is significant (Е, в = 10.431, one-sided р = 0.001). Additionally, a post hoc test 
reveals that the explanation development of participants in the partial-cue pattern condition 
(0.60) is greater than that of those in the no-cue pattern condition (ЕР, ~ = 6.326, one-sided 
p = 0.008). Overall, this evidence is consistent with Hla; matched specialists have better 
developed problem representations in the partial- and full-cue pattern conditions than in the 
no-cue pattern condition. 

Mismatched participants who receive full-cue patterns have recall problem represen- 
tations related to the seeded misstatement (0.03) that are at least as well developed as those 
of mismatched participants in the no- and partial-cue pattern conditions (mean —0.47) 
(Е, ва = 1.692, one-sided р = 0.099). Mismatched participants who receive full-cue patterns 
have explanation problem representations related to the seeded misstatement (—0.51) that 
are at least as well developed as those of mismatched participants in the no- and partial- 
cue pattern conditions (mean —0.63) (Е, с = 0.141, one-sided р = 0.354). This evidence 
is consistent with H1b. 

Finally, matched participants in the partial-cue pattern condition have better developed 
recall problem representations for the seeded misstatement (0.28) than do mismatched par- 
ticipants in the partial-cue pattern condition (—0.35) and this difference is significant at the 
10 percent level (Е, «; = 2.396, one-sided р = 0.063). Matched participants in the partial- 
cue pattern condition have better developed explanation problem representations (0.60) than 
do mismatched participants in the partial-cue pattern condition (—0.46) and this difference 
is significant (Е, ,, = 6.931, one-sided р = 0.006). This evidence supports H1c; matched 
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specialists have better developed problem representations in the partial-cue pattern condition 
than do mismatched specialists. 


Hypothesis 2 

In H2a, I predict that matched participants will assess the likelihood of m:sstatement 
higher when they receive partial- or full-cue patterns than when they receive no-cue pat- 
terns. In H2b, I predict that mismatched participants' likelihood assessments will be at least 
as high when they receive full-cue patterns as when they receive no- or partial-cue patterns. 

I estimate a repeated-measures ANOVA with matched and mismatched likelihood of 
misstatement as the dependent variables and industry match (within participant) and cue- 
pattern condition (between-participant) as independent variables. Table 3 provid2s descrip- 
tive statistics (Panel А) and planned contrast results (Panel B). Cell means аге zraphed in 
Figure 3. Matched participants’ mean likelihood assessments in the partial- and full-cue 
pattern conditions (66.12) is greater than those in the no-cue pattern condition (43.90) and 
this difference is significant (Е, с, = 6.720, one-sided p = 0.006). Additionally, a post hoc 
test reveals that the mean likelihood assessments in the partial-cue pattern condition (67.44) 
is greater than the mean in the no-cue pattern condition (Е, с, = 5.824, one-sided р 
= 0.008). The mean of the mismatched participants’ likelihood assessments in the full-cue 
pattern condition (54.67) is at least as great as the mean in the no- and partial-cue pattern 
conditions (54.57); however, this difference is not significant (Е, с = 0.000, one-sided 


TABLE 3 
H2: Likelihood Assessments 
LS Mean (SE) [n] Cell 


H2a contrast: A « (B + C)/2 
H2b contrast: (D + Е)/2 = Е 


Panel A: Likelihood Assessments 














Cue Pattern 
No Partial Full 
Matched 43.90 67.44 64.81 
(7.05) (6.74) (7.05) 
[21] [23] [21] 
А B C 
Mismatched 46.14 63.00 54.67 
(6.23) (5.95) (6.23) 
[21] [3] - [21] 
D E F 
Panel B: Tests of H2 
p>F 
Planned Contrasts Fa (one-tailed) 
Matched No < (Matched Partial + Full)/2 6.720 0.006 
(Mismatched No + Partial)/2 = Mismatched Pull 0.000 0.495 


See Table 2 for explanations of industry match and cue-pattern variables. 

The dependent variable is the likelihood that a material misstatement was contained in the financial s-atements. 
Participants rate this likelihood on a 101-point scale anchored by 0 (No chance of material misstatement) and 
100 (Certain of material misstatement). 
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FIGURE 3 
Likelihood Assessment Results 
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See Figure 1 for explanations of industry match and cue-pattern variables. 

The dependent variable is the likelihood that a material misstatement was contained in the financial statements. 
Participants rate this likelihood on a 101-point scale anchored by 0 (No chance of material misstatement) and 
100 (Certain of material misstatement). 





p = 0.495). Contrary to expectations, mismatched participants’ likelihood assessments in 
the partial-cue pattern condition (63.00) are marginally higher than those in the no-cue 
pattern condition (46.14) (Е, ,, = 3.832, two-sided р = 0.055). 

This evidence supports H2a for the matched participants; their likelihood assessments 
are higher in the full- and partial-cue pattern condition than in the no-cue pattern condition. 
However, H2b is not supported. Instead, it appears that mismatched participants may be 
aware that there is increased risk in the partial-cue pattern condition. 


Hypothesis 3 

In H3, I predict that matched specialists’ problem representation development will me- 
diate the relationship between likelihood assessments and the pattern-condition manipula- 
tion; this relationship is not predicted for mismatched specialists. I use mediation analysis 
to investigate this (Baron and Kenny 1986).'* 


Matched Specialists 
I expect problem representation development to mediate the relationship between the 
independent variable, pattern completeness, and the dependent variable, matched likelihood 


14 A mediation analysis requires a relationship to mediate. Generally, the following analysis is required: First, the 
dependent variable of interest (¢.g., likelihood assessment) is shown to be related to the independent variable 
(e.g., pattern completeness). Second, the expected mediator variable (e.g., overall problem representation mea- 
sure) is shown to be related to the independent variable. Third, the mediator variables are shown to be correlated 
with the dependent variable. Finally, the mediator and original independent variables are run against the depen- 
dent variable. If the mediator variable mediates the relationship between the dependent variable and the original 
independent variable, then the significance of the original independent variable will be reduced over the first 
ANOVA and the mediator variable will be significant in this analysis. 
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assessment. The matched likelihood assessment is correlated with the problem representa- 
tion measure (т = 0.354, Bonferroni adjusted р = 0.038). 

An ANOVA (Panel A of Table 4) shows that the matched likelihood assessment is 
related to the independent variable pattern completeness (E, с = 3.417, р = 0.039). Ап 
ANOVA (Panel B of Table 4) shows that the problem representation measure is affected 
by pattern completeness (E; „ = 4.226, р = 0.019). Finally, an ANOVA (Panel С of Table 
4) with matched likelihood assessment as the dependent variable and pattern completeness 
and the problem representation measure as independent variables shows that the relationship 
between the matched likelihood assessment and pattern completeness is mediated by prob- 
lem representation development. In particular, problem representation development is sig- 
nificant (Е, зо = 5.634, р = 0.021) and pattern completeness is no longer significant in this 
ANOVA (Е, 5, = 1.666, р = 0.198). This analysis demonstrates that pattern condition does 
not directly affect likelihood assessments for matched participants; instead, pattern condition 
affects matched likelihood assessments indirectly through its effect on problem represen- 
tation development. This evidence supports H3. 


Mismatched Specialists 

The relationship examined, but not expected to mediate, is between the dependent 
variable mismatched likelihood assessment and the independent variable pattern condition 
and the potential mediator, the problem representation measure. The mismatched likelihood 
assessment is not correlated with the problem representation measure (r = 0.066, Bonferroni 
adjusted p = 1.000). 

An ANOVA (not tabulated) shows that the mismatched likelihood assessment is not 
related to the independent variable pattern completeness (Е, „ = 1.917, р = 0.156). There- 
fore, for the mismatched participants, the relationship examined between the dependent 


TABLE 4 
Matched Participants’ Mediation Analysis 
Sum of  . 
Source Squares df F p>F 
Panel A: Effect of Cue Pattern on MatchLA 
Cue Pattern 7,130.28 2 3.417 0.039 
Error 64,696.70 — 62 


Panel B: Effect of Cue Pattern on MatchPR 


Cue Pattern 50.10 2 4.226 0.019 
Error 367.45 62 

Panel C: Effect of Cue Pattern and MatchPR on MatchLA 

Cue Pattern 3,264.97 2 1.666 0.198 
MatchPR 5,520.90 1 5.634 0.021 
Cue Pattern * MatchPR 1,517.32 2 0.774 0.466 
Error 57,817.11 59 

Variable Definitions: 


MatchLA = matched specialists’ likelihood assessments; and 
MatchPR = matched specialists’ overall problem representation measures. 
See Table 2 for explanation of the cue-pattern variable. 
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variable (likelihood assessments) and the independent variable (pattern completeness) is not 
significant and likelihood assessments are not correlated with the problem representation 
measure, so there is no potential for mediation, as expected.!? 


Hypothesis 4 

In H4a, I predict that matched participants who receive full- and partial-cue patterns 
will allocate more time to procedures that discriminate whether a misstatement is present 
than will matched participants who receive no-cue patterns. In H4b, I predict that mis- 
matched participants who receive full-cue patterns will allocate at least as much time to 
discriminatory procedures as will mismatched participants who receive no- or partial-cue 
patterns. 

I estimate a repeated-measures ANOVA with matched and mismatched time allocated 
to discriminatory procedures as the dependent variable and industry match (within partic- 
ipant), case order, and cue-pattern condition (between-participant) as independent variables, 
and matched and mismatched total time allocated to all procedures as covariates. I present 
the adjusted cell means and results of planned contrasts in Panels A and B of ‘Table 5 
and the adjusted cell means are graphed in Figure 4. Matched participants’ mean time 
allocated to discriminatory procedures in the partial- and full-cue pattern conditions (3.66) 
is greater than that in the no-cue pattern condition (1.69) and this difference is significant 
(Р, 57 = 3.389, one-sided р = 0.036). Additionally, a post hoc test reveals that the mean 
time allocated to discriminatory procedures in the partial-cue pattern condition (3.04) is 
greater than the mean in the no-cue pattern condition, however, this difference is not sig- 
nificant (ЕР, 6 = 1.217, one-sided р = 0.137). Mismatched participants in the full-cue 
pattern condition allocated at least as much time to discriminatory procedures (2.24) as 
those participants in the no- and partial-cue pattern conditions (1.35), (Е, з, = 0.916, one- 
sided p = 0.171). These results support H4a and H4b. 


Hypothesis 5 

In H5, I predict that mismatched participants will allocate more time to procedures that 
likely will not discriminate whether a misstatement is present than will matched participants. 
I estimate a repeated measures ANOVA with matched and mismatched time allocated to 
procedures that likely will not discriminate the presence of the target misstatement as the 
dependent variables and industry match (within participant), case order, and cue-pattern 
condition (between-participant) as independent variables, and matched and mismatched total 
time allocated to all procedures as covariates. Adjusted means and results of planned con- 
trasts are presented in Panels C and D of Table 5. Mismatched participants allocated more 
time to procedures that did not discriminate whether a misstatement is present (8.88) than 
did matched participants (7.51) and this difference is significant (Е, з, = 3.093, one-sided 
р = 0.042). This evidence supports H5. 


Additional Analysis 
I perform additional analysis to ensure that participants performed better in their spe- 
cialty industry due to knowledge differences rather than because they worked harder on the 


15 For completeness, an ANOVA shows that the problem representation measure is not affected by pattern com- 
pleteness (E; є = 1.077, р = 0.347). Finally, an ANOVA with mismatched likelihood assessment as the de- 
pendent variable and pattern completeness and the problem representation measure as independent variables 
shows that the relationship between the mismatched likelihood assessment and pattern completeness is not 
mediated by the problem representation measure; pattern completeness is not significant in this ANOVA (Е, 55 

. = 1.395, р = 0.256) and the problem representation measure also is not significant (Р, s = 0.166, р = 0.686). 
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H4a contrast: A < (B + C)/2 
Нађ contrast: (D + Е)/2 = Е 


TABLE 5 
Procedure Discrimination 


Adjusted LS Mean (SE) [n] Cell 


Panel A: Time Allocated to Discriminatory Procedures 








Cue Pattern 
No : Partial 
Matched 1.69 3.04 
(0.88) (0.84) 
[21] [23] 
А B 
Mismatched 1.06 1.63 
(0.77) (0.74) 
[21] [23] 
D E 
Panel B: Tests of H4 
Planned Contrasts Е, 57 
Matched No < (Matched Partial + Full)/2 3.389 
(Mismatched No + Partial)/2 = Mismatched Full 0.916 
Panel C: Time Allocated to Non-Discriminatory Procedures 
Cue Pattern 
None Partial 
Matched 8.77 7.42 
(0.88) (0.84) 
[21] [23] 
А B 
Mismatched 9.43 8.86 
(0.77) (0.74) 
[21] [23] 
р Е 
Panel D: Tests of H5 
Planned Contrasts Е, = 
Mismatched > Matched 3.093 


See Table 2 for explanations of industry match and cue- pattern variables. 
The dependent variable for H4a and H4b is the total time, in hours, participants allocated to procedures that can 
discriminate whether the target misstatement was present. The dependent variable for H5 is the tota! time, in 
hours, participants allocated to procedures that would not discriminate whether the target misstatement was 


present. 


Hammersley 


case that matched their specialization than on the case outside their specialization. Partic- 
ipants completed ratings of effort and difficulty immediately after the recall task for each 
case. They respond on 11-point Likert scales anchored by 0 (very easy) and 10 (very hard) 
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FIGURE 4 
Discrimination of Procedures Results 
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See Figure 1 for explanations of industry match and cue-pattern variables. 
The dependent variable is the total time participants allocated to procedures that can discriminate whether the 
target misstatement was preseat. 


for the difficulty question and 0 (not at all hard) and 10 (very hard) for the effort question. 
Participants report that the cases mismatched on specialty are more difficult (6.01) than the 
cases matched on specialty (5.27); this difference is significant (Е, єр = 10.501, two-sided 
p = 0.002). They report that they work harder on the mismatched cases (5.78) than on the 
matched cases (4.17); this difference is significant (Е, в = 41.778, two-sided р = 0.000). 

Because participant self-reports of effort are noisy measures of effort (Bettman et al. 
1990), I also compare participant work output (i.e., the number of items recalled for each 
case) as a measure of effort. The number of items recalled does not differ depending on 
whether the participants are matched on industry specialty (Е, „„= 1.521, two-sided p 
— 0.222). This analysis, in combination with the observed systematic differences in the 
problem representation development, provides evidence that knowledge, not effort, explains 
the results. 


V. CONCLUDING REMARKS 

Limitations 

I am aware of two boundary conditions in relation to the theory I test in this paper. 
First, the theory applies to situations in which industry specialists are expected to have a 
comparative advantage over nonspecialists due to knowledge differences. This comparative 
advantage is expected to exist when patterns are composed of industry-specific cues or are 
focused on industry-specific transactions. Industry specialists are unlikely to have as large 
a comparative advantage in situations where there are opportunities for knowledge transfer 
by analogy or direct transfer. Knowledge transfer likely occurs when a situation in one 
industry is easily mapped into a target situation in another industry (Thibodeau 2003). 
Transfer is likely when auditing common “generic” accounts (e.g., property, plant, and 
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equipment) as virtually all firms have such accounts and the audit procedures, accounting 
requirements, and sources of risk are similar across industries. Transfer also is likely in 
similar industries or where sources of risk, methods of accounting, or types of transactions 
are similar. In my study, to insure a powerful test of my theory, I chose industries that have 
little in common and I designed the cases around issues that are unique to these industries 
so that opportunities for knowledge transfer were minimized. 

Second, the theory specifies how industry specialists are expected to react when a 
misstatement is present. It is likely that there are implications for how industry specialists 
react when a misstatement is absent as well; however, those predictions are not developed 
in this paper. This may be a fruitful area for future research. 


Conclusions 

Misstatements that are difficult to diagnose because they are complex or hidden from 
the auditor may be described by a pattern of cues. The division of labor on audits suggests 
the possibility that different auditors will collect the cues that form a pattern. Features of 
the audit ecology, such as the large amount of information to be processed, the types of 
tasks performed, and the size of audit teams make it likely that little sharing of unique 
information about pattern cues will occur. The brainstorming requirements of SAS No. 99 
may improve unique information sharing for auditors in identified fraud risk areas. This is 
an open question that is left to future research. However, it remains important to understand 
how well individual auditors interpret incomplete patterns suggestive of misstatement. 

In this paper, I examine whether industry-specialist auditors use their problem repre- 
sentations to help them interpret incomplete patterns that are descriptive of misstatement. 
Consistent with expectations, matched specialists develop more complete problem repre- 
sentations about a seeded misstatement when they receive partial- or full-cue patterns than 
when they receive no-cue patterns. These problem representations are related to differences 
in matched specialists’ assessments of misstatement likelihood. Additionally, matched spe- 
cialists specify audit procedures that effectively and efficiently discriminate the presence of 
the seeded misstatement. Mismatched specialists do not develop more complete problem 
representations with more pattern cues and their problem representations are not related to 
their assessments of misstatement likelihoods. Additionally, mismatched specialists do not 
suggest audit procedures that either would effectively or efficiently discriminate the pres- 
ence of the seeded misstatement. 

Overall, these results imply that matched specialists are able to interpret and fill in 
partial-cue patterns. They respond to partial-cue patterns by increasing the assessed risk of 
misstatement. Mismatched specialists do not suspect the seeded misstatement under these 
conditions. This suggests that identification of incomplete patterns of misstatement may be 
a comparative advantage for industry-specialist auditors. 

My study contributes to the audit-knowledge literature by expanding our understanding 
of the conditions under which performance will improve due to industry specialization. 
Previous research has demonstrated that matched industry-specialist auditors know more 
non-error explanations for unusual fluctuations than mismatched specialists (Solomon et al. 
1999). This gives rise to the concern that industry specialists may "explain away” anom- 
alous facts as non-errors, rather than seriously consider them as indicators of misstatement. 
However, in a small sample study, Wright and Bedard (2000) report that auditors with high 
domain experience do concentrate on error hypotheses. I provide additional evidence that 
matched industry specialists develop more complete problem representations, use these 
representations to assess higher likelihoods of misstatement, and specify discriminatory 
procedures to determine whether the seeded misstatement exists. 
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My study contributes to the pattern-reco gnition literature by demonstrating that 
industry-specialist auditors are able to fill in the missing pieces of a partial-cue pattern. 
Previous research on pattern recognition studied auditors' ability to identify the significance 
of complete patterns indicative of misstatement leaving open the question of whether au- 
ditors identify the significance of incomplete patterns of misstatement. Interpreting incom- 
plete patterns is arguably a more difficult task and one that is likely to assist in identification 
of misstatement concealed by a client. 


APPENDIX A 
State and Local Government Instrument!5 
City of Middletown Case 


You have been assigned the revenue section of the general-purpose financial statement 
audit of the City of Middletown for the June 30, 2001 fiscal year. The City of Middletown 
has been a client of the firm for the past eight years. During that time there has been little 
turnover in the controller's office staff; the personnel are competent and it is easy to work 
with them. Adjustments, when necessary, have been made with little conflict. 

The City's (unaudited) total revenue in the General Fund for the year ended June 30, 
2001 is $25,211,000. Materiality has been set at $125,000 on this engagement. 

You are currently focusing on the revenue in the Capital Projects Fund. Total (unau- 
dited) revenue in the Capital Projects Fund is $904,000. The revenue comes from two 
sources: a Federal Government Grant for $250,000 and Sewer Benefit Tax revenue of 
$654,000. 

The Federal Government Grant received during fiscal 2001 was for renovation of two 
park buildings and surrounding grounds. The grant period is January 1, 2001 to December 
31, 2002, the period during which the improvements must be completed; any money un- 
spent at the end of this period must be returned. The grant totals $3,420,000; the renovations 
to the buildings and grounds are expected to cost $4,275,000. The City has transferred the 
$855,000 of funds necessary to complete the project. They broke ground on the improve- 
ments in March 2001. 


* The City has incurred $1,250,000 in costs on the project as of June 30; this is 29.24 
percent of the project's current expected cost.!" 

* The City receives quarterly payments of $427,500 each on the 10th day of each 
calendar quarter; the City received payments this year on January 10 and April 10. 

* The City uses a 50-day availability period for revenue recognition. 


The Sewer Benefit Tax is collected from all city property owners to pay for sanitary 
sewer improvements. The sewer tax is billed annually and is based on the volume of water 
that is used and returned to the City's sanitary sewer system. Bills are sent on February 1 
of each year, with payments due by March 15. Revenues from this tax are restricted as to 


is The issue in this case is how much of the Federal Government Grant revenue should be recognized during the 
June 2001 fiscal year. This reimbursement-driven grant is a Voluntary Nonexchange Transaction and "recipients 
should recognize revenues in the period when all applicable eligibility requirements have been met and the 
resources are available" (GASB Мо. 33, 130, GASB 1998). There are four eligibility requirements specified in 
the standard: required characteristics of recipients, time requirements, reimbursements, and contingencies (GASB 
No. 33, 120) and they have all been met by this grant. Three payments, or $1,282,500, of this grant are available 
as of June 30 because of the City's 60-day availability policy. The eligibility requirements limit the revenue 
recognition to the amount of reimbursable costs that have been incurred to date, here $1,250,000. Consequently, 
the client should recognize $1,250,000 of grant revenue in this case, representing a misstatement of $1,000,000. 

Note that this is the full-pattern condition government case. The partial-cue (no-cue) pattern condition case 
included the first two (none of the) bulleted points in the fourth paragraph. 
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use for payment for sanitary sewer improvements and maintenance. The City collected 
$654,000 and $621,000 in sanitary sewer tax during the fiscal years ended June 30, 2001 
and 2000, respectively. The accounts receivable balances for uncollected Sewer Benefit Tax, 
net of estimated uncollectible balances, are $47,000 and $43,000 at June 30, 2001 and 
2000, respectively. 


APPENDIX B 
Banking Instrument!* 
Springfield Bank Case 

You have been assigned to the December 31, 2000 audit of Springfield Bank, a regional 
bank in Springfield, NJ. Springfield Bank has been a client of the firm for the past eight 
years. There has been little turnover in the accounting staff during this time period. The 
bank's management and accounting staff are competent, and it is easy to work with them. 
Additionally, on previous engagements management has readily agreed to make the audit 
adjustments the firm has determined to be necessary. 

Current year (unaudited) interest revenue totals $82.6 million. Materiality for this en- 
gagement has been set at $400,000. 

You are currently focusing on the fee revenue. Fee revenue for the current year totals 
$1,357,000 and comprises 14.5 percent of the bank's net income of $9,385,000. Fee revenue 
comes from two sources: loan fees of $910,000 and credit card fees of $447,000. 

Loan fees are collected when a borrower applies for a mortgage; the fees are nonre- 
fundable. During 2000, Springfield received about 24,200 mortgage loan applications. To 
originate a mortgage, Springfield incurs various costs. These include costs associated with 
the application, appraisal, credit report, underwriting, and closing. Springfield prices its 
loan fees to be competitive in its market. 


e During 2000, Springfield closed approximately 23,000 mortgage 1оапз.19 

* The loan fee price of $375 covers the costs associated with originating a mortgage 
and provides a small surplus of about $20 per loan. 

* When mortgage loans close, loan fee revenue equal to the cost associated with 
making the loans is deferred. 


Credit card fee revenue totaled $447,000 during 2000. The fees cover servicing and 
maintaining credit card accounts and other services offered to the credit card holders. Credit 
card fees are billed annually to card holders on the anniversary of the dates on which they 
opened their accounts. Total fees billed to credit card customers during 2000 were $452,000. 
The fees are deferred and recognized on a straight-line basis over the period the fee entitles 
the cardholder to use the card. Deferred credit card fees, net of estimated uncollectibles, 
total $222,000 and $217,000 at December 31, 2000 and 1999, respectively. 


18 The issue in this case is how much of the loan fee revenue should be recognized during the year. These loan 
fees are direct loan origination fees and as such they should be deferred and recognized over the life of the loan 
as an adjustment to yield (interest income). Additionally, direct loan origination costs must be deferred and 
recognized as a reduction in loan yield. The loan fees and costs must be offset and the net amount should be 
deferred and amortized (SFAS No. 91, 15, FASB 1986). Consequently, loan fee revenue is overstated by $460,000 
in this case. 

19 This is the full-pattern condition banking case. The partial-cue (no-cue) pattern condition case included the first 
two (none of the) bulleted points in the fourth paragraph. 
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ABSTRACT: We Investigate the risk relevance of the standard deviation of three per- 
formance measures: net income, comprehensive income, and a constructed measure 
of full-fair-value income for a sample of 202 U.S. commercial banks from 1996 to 2004. 
We find that, for the average sample bank, the volatility of full-falr-value income із more 
than three times that of comprehensive income and more than five times that of net 
Income. We find that the Incremental volatillty in full-fair-value income (beyond the 
volatility of net income and comprehensive Income) із positively related to market- 
model beta, the standard deviation In stock returns, and long-term interest-rate beta. 
Further, we predict and find that the Incremental volatility іп full-fair-value Income (1) 
negatively moderates the relation between abnormal earnings and banks' share prices 
and (2) positively affects the expected retum implicit In bank share prices. Our findings 
suggest full-fair-value income volatility reflects elements of risk that are not captured 
by volatility in net income or comprehensive Income, and relates more closely to 
capital-market pricing of that risk than either net-income volatility or comprehensive- 
income volatllity. 
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I. INTRODUCTION 

nder current United States generally accepted accounting principles (GAAP) and 
| | International Financial Reporting Standards (IFRS), net income and comprehensive 
income include unrealized fair-value gains and losses on some financial instruments 
but exclude them on the majority of financial instruments. In recent years, U.S. and inter- 
national accounting rule makers have discussed whether to require income recognition of 
fair-value gains and losses on a more complete set of financial and derivative-financial 
instruments. Net income, comprehensive income, and ''full fair-value income" (FFV in- 
come) can portray firm performance and risk differently for firms with significant exposure 

to changes in fair values of financial instruments. 

Commercial banks' balance sheets consist almost entirely of financial instruments. The 
current income-measurement regime produces reported-income numbers that exclude un- 
realized fair-value gains and losses on financial instruments that are integral to banks' core 
operations and risk-management strategies, including held-to-maturity securities, loans, de- 
posits, other financial liabilities, and certain types of derivatives. Given the importance of 
these financial instruments and derivatives for assessing bank performance and risk, we 
construct a measure of FFV income that includes unrealized fair-value gains and losses on 
all of these financial instruments and derivatives for a sample of U.S. commercial banks. 
We then address three research questions on the risk-relevance of the time-series volatilities 
of banks' net income, comprehensive income, and our measure of FFV income. The ob- 
jective of the paper is to evaluate how well these three measures of income volatility 
represent commercial bank risk. 

Our first research question asks: How do the volatilities of these three income measures 
compare? These comparisons are interesting because, during the last decade, bank managers 
and their representatives have made conflicting assertions regarding the usefulness of in- 
come measures that include changes in fair values of financial instruments and derivatives. 
For example, in the mid-1990s banking organizations criticized the Financial Accounting 
Standards Board's (FASB) proposal for comprehensive-income measurement and reporting 
because it would fail to fully reflect banks’ risk-management activities, trigger **excessive" 
volatility in comprehensive income, and lead investors to overstate their risk assessments 
(Hirst et al. 2002). In striking contrast to their prior statements, these same organizations 

. recently defended the recognition of fair-value changes on a subset of financial instruments 
in the currently reported measure of comprehensive income as being the “optimal means 
of reporting financial performance" for banks. They asserted that FFV accounting will 
provide a “false and misleading picture” of excessive volatility in banks’ profitability (Dean 
2000, 3) and will misrepresent banks' risk because it is irrelevant and unreliable (Joint 
Working Group of Banking Associations on Financial Instruments 1999). 

If banks effectively hedge at an entity-wide level the risk factors that determine fair- 
value gains and losses on financial instruments, and if comprehensive income recognizes 
fair-value gains and losses on only a subset of financial instruments, then FFV income may 
be less volatile over time than comprehensive income. In contrast, if comprehensive income 
only partially captures banks' exposures to fair-value gains and losses, then FFV income 
may be more volatile over time than comprehensive income. In either case, we predict that 
FFV income is a more complete representation of the outcomes of banks' risk-management 
activities; therefore, volatility in the time-series of FFV income should be more indicative 
of banks' underlying risks. 

Our second research question addresses whether the three different measures of income 
volatility are associated with market-based risk measures. To address this question, we 
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examine the associations between the three income-volatility measures and four stock- 
market-based risk factors (1.е., market-model beta, short-term and long-term interest-rate 
betas, and stock-return volatility), as well as two disclosed measures of banks’ exposure to 
market-based risks (i.e., total exposure to derivatives and the gap between fixed-rate interest- 
bearing assets and fixed-rate interest-bearing liabilities that reprice within one year). To the 
extent these market-based risk measures represent systematic- and total-risk factors, our 
analysis provides evidence on the risk-related information content of each income-volatility 
measure. If net income and comprehensive income do not fully reflect the fair-value gains 
and losses that are triggered by banks’ overall risk exposure, then volatility based on these 
two income measures will misrepresent banks’ risks. We predict that, because FFV in- 
come includes fair-value changes for nearly all financial instruments, volatility in FFV 
income should be a more complete proxy for systematic and total risk, and therefore should 
be more closely associated with banks’ market-based risk measures than either volatility in 
net income or comprehensive income. 

Our third research question investigates the extent to which the по components 
of the three income-volatility metrics moderate the capitalization of earnings in banks’ share 
prices and explain capital-market pricing of bank risk. If the income-volatility metrics 
capture risk factors that the capital markets price, then higher volatility should be associated 
with greater risk and therefore higher expected returns, which should result in lower cap- 
italization of earnings in share price, all else equal. We address this question in two stages. 
First, we examine the extent to which cross-sectional variation in abnormal-earnings- 
valuation multiples is conditional on each of the income-volatility metrics within a residual- 
income-valuation model. We predict that abnormal-earnings-valuation multiples implicit in 
bank share prices are more negatively related to volatility in our unreported-but-more- 
complete FFV-income measure, and less negatively related to volatility in reported-but- 
incomplete net income and comprehensive income. Second, we examine the extent to which 
expected returns implicit in bank share prices are conditional on each of the income- 
volatility metrics. We predict that bank share prices reflect required rates of return on bank 
equity that are increasing in the incremental volatility in FFV income, beyond the volatility 
in net income and comprehensive income. 

To address our research questions we construct a panel of 202 U.S. commercial banks 
with complete fair-value information and stock-price data for the years 1996—2004. For 
each bank, we obtain three measures of income—reported net income, reported compre- 
hensive income, and our measure of FFV income, which recognizes fair-value gains and 
losses on virtually all financial instruments in income in the periods when they arise. We 
estimate volatility by calculating the standard deviation of the time-series of each of the 
three income measures (as a percent of average total assets) for each bank. 

With respect to our first question, we find that FFV-income volatility is signifi- 
cantly greater than net-income volatility for 90 percent of our sample banks, and sig- 
nificantly greater than comprehensive-income volatility for 77 percent of our sample banks. 
For the majority of our sample banks, the fair-value adjustments banks recognize in com- 
prehensive income do not covary significantly negatively with (i.e., do not appear to hedge) 
the incremental fair-value adjustments that banks would recognize in FFV income. Fur- 
ther, the volatility of the incremental FFV-income adjustments greatly exceeds the volatility 
of fair-value adjustments recognized in comprehensive income. 

Regarding our second research question, we find that the volatilities of all three income 
measures exhibit varying degrees of positive correlation with market-risk factors and dis- 
closed measures of market-risks. In particular, net income volatility exhibits the most con- 
sistent and robust correlations across the risk factors we examine, and correlates most 
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strongly with the standard deviation in stock returns. We find that FFV-income volatility 
correlates positively with the standard deviation in stock returns and banks’ exposure to 
derivatives. We also find that the incremental volatility in FFV income (beyond volatility 
in net income and comprehensive income) is positively associated with the standard devi- 
ation in stock returns, market-model beta, and long-term interest-rate beta, even after con- 
trolling for banks’ disclosed measures of total exposure to derivatives and the gap between 
fixed-rate interest-bearing assets and fixed-rate interest-bearing liabilities that reprice within 
one year. | 

Finally, to address our third research question, we predict and find that the incremental 
volatility in FFV income negatively moderates the capitalization of abnormal earnings in 
bank share prices. We also predict and find that expected returns on bank equity are in- 
creasing in the incremental volatility in FFV income. We observe these results in bank 
share prices and expected rates of return after controlling for the effects of the incremental 
volatilities in net income and comprehensive income and after controlling for the infor- 
mation in banks’ disclosures about exposure to market risks related to the asset-liability 
repricing gap and derivatives. 

Taken together, our results suggest that the majority of our sample banks are not fully 
hedged against year-to-year changes in fair values of their reported financial instruments, 
and that volatility in net income and comprehensive income provide an incomplete picture 
of their fair-value-risk exposure. The results show that, for our sample of banks during our 
study period, the volatility of incremental FFV income captures elements of bank risk that 
the capital markets price, but that the volatilities of net income and incremental compre- 
hensive income omit. We believe these findings are relevant for U.S. and international 
accounting standard setters as they consider whether to develop standards that would rec- 
ognize in income fair-value changes for all financial instruments.! These findings are also 
useful to capital-market participants and researchers interested in explaining the relations 
among banks' accounting numbers, share prices, and risk. The findings in this paper are 
also useful for bank regulators as they evaluate the role of fair-value measurement in mon- 
itoring bank capital adequacy. Academic research has established the risk-relevance of net- 
income volatility (e.g., Beaver et al. 1970), the value-relevance of fair values for a subset 
of financial instruments (e.g., Barth 1994), and the risk-relevance of market risk-related 
disclosures (e.g., Schrand 1997). We extend these three lines of research by demonstrating 
the risk-relevance of FFV-income volatility. 

We organize the paper in the following sections. In Section П, we motivate our research 
questions with the institutional features of fair-value accounting for banks and the prior 
research literature. In Section Ш, we describe our sample selection procedures and our 
variables. In Section IV, we describe three sets of tests and present the results. In Section 
V, we provide concluding remarks. 


II. INSTITUTIONAL BACKGROUND AND LITERATURE REVIEW 
Income measurement is a central function of accounting and is controversial. Current 
and proposed U.S. and international accounting standards include (at least) two different 


' We cannot generalize these finding to settings in which full fair-value income measurement is mandatory and 
bank managers face explicit contracting and regulatory constraints that depend on full fair-value income numbers. 
We do not purport that these results demonstrate the desirability of full fair-value accounting. Our evidence on 
the risk-relevance of full fair-value income volatility is only one of many facets of full fair-value accounting 
that standard setters might weigh in their deliberations. For a more comple:e debate and critique of the strengths 
and limitations of drawing inferences for accounting standard setters from capital-markets-based accounting 
research, see Holthausen and Watts (2001) and Barth et al. (2001). 
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measures of income—net income and comprehensive income—each including unrealized 
fair-value gains and losses on different subsets of financial instruments (e.g., Statement of 
Financial Accounting Standards (SFAS) No. 130, FASB 1997). The Joint Working Group 
of Standards Setters (JWGSS) was formed in 1997 for the sole purpose of developing a 
coherent framework for reporting financial instruments at fair value. Although it is not a 
standard-setting body, the JWGSS developed a proposal to require full fair-value accounting 
for most financial instruments (JWGSS 2000). The FASB and the International Accounting 
Standards Board (IASB) are considering requiring income-statement recognition of fair- 
value gains and losses on all financial instruments, which would result in a FFV-income 
measure and the most complete fair-value accounting standards to date.? 

The banking industry is in the center of the controversy over the relative usefulness of 
fair-value-based income measures because commercial banks' balance sheets consist almost 
entirely of financial instruments. For banks, these three different income metrics could 
provide strikingly different pictures of profitability and income volatility over time. For 
example, net income includes many (but not all) effects of core-banking activities, including 
interest revenue and expense, credit losses (i.e., loan-loss provisions), realized gains and 
losses on investments, operating expenses, fees, and taxes. Comprehensive income is a more 
complete measure of performance than net income because it also includes fair-value gains 
and losses on available-for-sale investment securities? However, comprehensive income is 
an incomplete measure of bank performance because it does not include fair-value gains 
and losses on financial instruments such as held-to-maturity securities, loans, financial li- 
abilities, and non-term deposits.* These additional fair-value gains and losses are important 
elements of bank risk and profitability, and would be included in measures of FFV income. 

Our paper advances three lines of academic research investigating the relations between 
reported income measures, fair values, risk, and share values. First and most generally, for 
a sample of non-bank firms, Beaver et al. (1970) establish the risk-relevance of earnings 
volatility and document that earnings volatility and earnings beta explain market model 
beta. More recent studies establish links between accounting-based risk measures and the 
pricing of risk in expected returns and share prices. 

Second, studies based on SFAS Nos. 107 and 115 (FASB 1991, 1993) document the 
value-relevance of fair values for certain balance-sheet components, but obtain mixed results 
on the association between fair-value gains and losses and stock returns (e.g., Barth 1994).$ 
"Third, recent studies examine the risk-relevance of various disclosures, such as interest-rate 
gap, notional amounts of derivatives, and the Securities and Exchange Commission's (SEC) 


2 Tn this paper, our references to ЕРУ accounting are consistent with the financial-instruments focus of the JWGSS 
al. 


propos 

3 Under current accounting rules, reported comprehensive income also reflects the effects of other items, specif- 
ically unrealized gains and losses on foreign-currency translation adjustments, minimum-pension liability ad- 
justments, and unrealized gains and losses from derivatives qualifying as cash-flow hedges. For the average 
bank, these items are typically less material than unrealized gains and losses on available-for-sale securities. 

* Prior to application of SFAS No. 133 (.е., fiscal years ending before 2001, FASB 1998), net income and 
comprehensive income also exclude fair-value gains and losses on derivative positions a bank may hold to 
manage interest-rate risk. 

5 For more recent examples, see Elgers (1980), Fama and French (1992), Feltham and Ohison (1999), Claus and 
Thomas (2001), Gebhardt et al. (2001), Botosan and Plumlee (2002, 2005), Easton et al. (2002), and Baginski 
b qum (2003). For a comprehensive review of the literature linking accounting numbers and risk, see Ryan 
(1 


a 


For additional examples, все Petroni and Wahlen (1995), Ahmed and Takeda (1995), Barth et al. (1996), Eccher 
et al. (1996), Nelson (1996), Dhaliwal et al. (1998), and Cahan et al. (2000). 
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Financial Reporting Release No. 48 market-risk disclosures, which relate to fair-value- 
income volatility (e.g., Schrand 1997).7 Our study links and extends these three prior lines 
of research in that this is the first study to use a comprehensive set of fair-value gains and 
losses to construct full fair-value income, and to assess the risk-relevance of different mea- 
sures of income volatility. 

Our study most closely relates to Barth et al. (1995). Using a 1971—1990 sample period, 
which precedes the effective dates for SFAS No. 107 and SFAS No. 115, Barth e: al. (1995) 
examine the incremental volatility in income induced by including unrealized fair-value 
gains and losses on all investment securities disclosed in bank regulatory reports. Barth et 
al. (1995) find that volatility in their constructed, partial-fair-value income metric is signif- 
icantly greater than volatility in reported net income, but that the incremental volatility in 
partial-fair-value income is not priced as risk. They point out that their results may be 
attributable to incomplete measurement of fair values in income, because banks did not 
disclose fair-value gains and losses on financial instruments other than investment securities. 

Our study also closely relates to prior experimental research in Hirst et al. (2004), 
which investigates the effects of alternative fair-value income measures for banks, and finds 
that risk assessments and valuation judgments of bank analysts are more likely to distinguish 
between high and low risk banks when banks measure and recognize all fair-value gains 
and losses in a FFV income statement. Because Hirst et al. (2004) limited their investigation 
to analysts’ judgments, they do not make inferences about the relation between fair value 
information and market-determined security prices. 


Ш. SAMPLE AND DESCRIPTIVE STATISTICS 
In this section we describe our sample selection procedures, variables, and tests. We 
also discuss descriptive statistics for our sample data. 


Sample Selection 

We draw our sample from the set of U.S. bank holding companies filing Y9-C reports 
with the Federal Reserve and financial statements with the SEC. from 1995—2004. We 
collect accounting data, including fair-value footnote disclosure data, from Y¢-C reports 
and the SNL DataSource database. When fair-value data items are missing “rom these 
sources, we hand-collect them directly from banks’ Forms 10-K and Y9-C reports. We 
collect stock prices and returns from CRSP. Our final sample comprises the set of 202 
banks for which we could collect complete data for the ten years 1995—2004.* We use 1995 
data for differencing, leaving nine years of data to calculate firm-specific standard deviations 
of income and other variables of interest. In total, we have a sample of 1,818 >ank-years 
for our tests. Because of these data requirements, our sample inevitably excludes banks that 
were acquired, failed, started up, or went public during our sample period, and banks that 
do not file Form 10-K (typically smaller, privately held banks). 

Table 1 provides descriptive statistics that indicate that our sample banks represent a 
broad cross-section, with market value of equity over the nine-year period ranging from $2 
million to $190 billion, and average total assets ranging from $41 million to $1,045 billion. 
For comparison, we examined descriptive statistics (untabulated) for the sample of banks 


7 In а related paper, Hodder (2005) documents the value-relevance of banks’ market risk discloscres based on 
cash flow and fair-value volatilities. 

* For the subset of our tests that examine expected rates of return implicit in bank share prices, we also collect 
consensus analyst earnings forecast data from the Institutional Brokerage Estimates System (I/B/E/S). Because 
not all sample banks are followed by analysts that report their forecasts to I/B/E/S, this date requirement 
further restricts our sample for these tests to the subset of (larger) sample banks with available fcrecast data. 
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that met our accounting and stock-price data requirements for at least one year during 
1996—2004. In general, for the variables of primary interest, our sample appears represen- 
tative of the larger population of publicly held banking firms, diminishing concerns about 
survivorship and size bias. 


Income Measures 

We construct our measure of FFV income to be as complete a measure of FFV income 
as possible, encompassing fair-value accounting for all financial instruments. We compute 
our measure of FFV income by adjusting comprehensive income for unrealized fair-value 
gains and losses disclosed under SFAS Nos. 107 and 133 for held-to-maturity investment 
securities, loans, term deposits, long-term financial liabilities, and derivatives. Using Form 
Y9-C regulatory report data, we also adjust comprehensive income for unrealized fair-value 
gains and losses on non-term deposits. Fair values of non-term deposits are »articularly 
difficult to estimate. We estimate their fair values using a static discounted cash flow ap- 
proach used by bank regulators, together with firm-specific deposit data obtained from bank 
holding company Y9-C reports. Finally, to be consistent with the after-tax measures of 
comprehensive income and net income, we measure all of our fair-value adjustments after 
tax effects using the statutory Federal income tax rate. We describe our variable compu- 
tations in detail (particularly the fair-value computations for non-term deposits) in the 
Appendix. 

Table 2 contains descriptive statistics for the three primary income measures and se- 
lected components of income. We present the variables in the order in which the income 
components and totals might appear in a typical bank income statement if the bank were 
to report FFV income. For example, the panel begins with net interest income, the least 
inclusive income measure, and concludes with FFV income, the most inclusive. We express 
all variables as a percentage of average total assets each year to allow comparison across 
size within our cross-section of banks.? Panel A contains descriptive statistics for the full 
sample of 202 banks over 1996-2004 (i.e., n = 1,818). Panel B contains annual means for 
the full sample for each year 1996—2004. 

The statistics in Panel A of Table 2 indicate that the pooled nine-year means (medians) 
of these three income measures have statistically indistinguishable magnitudes as a percent 
of average total assets: 1.133 (1.136) percent for net income versus 1.149 (1.172) percent 
for comprehensive income versus 1.026 (1.069) percent for FFV income (test results not 
tabulated). The descriptive statistics in Table 2, Panel B reveal significant differences across 
income measures in certain years. For example, for the mean bank in 1998 and 2000, years 
in which interest rates fell, comprehensive income as a percent of average total assets was 
much higher than full fair-value income as a percent of average total assets, because FFV 
income included the fair-value losses on financial liabilities, derivatives, and non-term de- 
posits, which offset fair-value gains on financial assets.!° In 1999, by contrast, a year in 
which interest rates jumped, the average bank experienced large unrealized fair-value losses 


? We do not adjust average total assets for FFV gains and losses. This choice increases the FFV-income volatility 
metrics but only very slightly because net unrealized fair-value gains and losses tend to be a very small per- 
centage of total assets for the typical bank, even though they can be large relative to net income. 

19 To gauge the economic significance of the reported differences in income measures when scaled by average 
total assets, consider that the average total assets for the mean bank-year in our sample is $15,745.66 million. 
Therefore, a 0.10 percent difference in income scaled by average total assets equates to a $15.746 million 
difference in income for the mean bank-year. Also, because banks are typically highly leveraged, a 0.10 percent 
difference in income scaled by average total assets equates to а 1.22 percent difference in return on equity for 
the mean bank-year. 
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on available-for-sale investment securities, so comprehensive income (0.656 percent of av- 
erage total assets) was much lower than net income (1.116 percent of average total assets). 
That year, the mean sample bank also experienced additional substantial fair value losses 
on financial assets, but these were partially offset by fair-value gains on financial liabilities 
and non-term deposits, so FFV income (0.728 percent of average total assets) presents a 
more favorable profit picture than comprehensive income. 

Figure 1 graphically depicts mean net income, comprehensive income, and FFV income 
as a percentage of average total assets for the sample of banks each year and pooled across 
time, together with percentage changes in interest rates during the sample period. The time- 
series of mean net income is the smoothest of the three income series, while comprehensive 
income and FFV income exhibit much wider year-to-year changes. Howeve-, as noted 
above, the pooled nine-year means of these three income measures are similar. These data 
show that, in any given year, exposure to interest-rate changes and other risks triggered 
dramatic differences in the three income measures; but, on average, fair-value edjustments 
in both comprehensive income and FFV income tended to zero out over this period. Com- 
paring the plots to percentage changes in interest rates, net income appears relatively 
invariant to rate changes. In contrast, comprehensive income covaries negatively with 
interest-rate changes and FFV income covaries positively with interest-rate changes. 


IV. EMPIRICAL TESTS AND RESULTS 
In this section, we describe our empirical tests and results. The section contains three 
parts, corresponding with our three research questions. 


Income Volatility 

The pattern of differences in volatility across income measures based on pooled data 
in Figure 1 and Table 2 is also evident in the firm-specific volatility analysis in Table 3. 
Panel A of Table 3 provides descriptive statistics for the firm-specific standard deviations 
of the three income measures (stated as a percentage of average total assets) across the 202 
sample banks over the years 1996—2004. The average sample bank experienced a standard 
deviation in net income of 0.260 percent and a standard deviation in comprehensive income 
of 0.394 percent. The standard deviation in FFV income for the average sample bank was 
1.442 percent, more than five times larger than the standard deviation in net income and 
more than three times larger than the standard deviation in comprehensive income. The left 
side of Figure 2 graphically depicts the means of the firm-specific standard deviations of 
our three income measures. The remainder of Figure 2 includes annual cross-sectional 
standard deviations (from Table 2, Panel B) and the pooled cross-sectional standard devi- 
ations (from Table 2, Panel A), which show that the standard deviation in FFV income is 
greater than that in comprehensive income and net income in each year, 1996 through 2004. 

Panel B of Table 3 reports the results of firm-specific tests of the relative volatility of 
our three income measures. The first row compares the standard deviations of net income 
and comprehensive income. Of the 202 sample banks, 186 (92.1 percent) experienced 
greater volatility in comprehensive income than net income. For 69 sample banks (342 
percent), firm-specific F-statistics reject (p < 0.05) the null hypothesis of equal variance 
(median F-statistic — 2.2). The second row of Panel B compares the standard deviations 
of net income and FFV income. The results indicate that FFV income is more volatile than 
net income for 200 sample banks (99.0 percent). For 182 banks (90.1 percent), firm-specific 
F-statistics reject (p « 0.05) the null hypothesis of equal variance (median F-statistic 
= 21.7). In the last row of Panel B, we compare standard deviations of FFV income 
and comprehensive income. FFV income is more volatile than comprehensive income for 
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194 banks (96.0 percent), with 155 banks (76.7 percent) experiencing significantly greater 
volatility in FFV income (median F-statistic = 7.6). 

For FFV-income volatility to be lower than comprehensive income volatility, the vari- 
ance of the FFV adjustments would have to be less than twice the negative covariance 
between comprehensive income and the FFV adjustments.!! Our analysis of the covariance 
structure among components of FFV income, summarized in Table 3, Panels C and D, 
suggests that this is not the case. Table 3, Panel C reports descriptive statistics of the firm- 
specific correlations among the income adjustments necessary to adjust from net income to 
comprehensive income to FFV income. These correlations indicate that for the average 
sample bank, comprehensive income adjustments are uncorrelated with net income. The 
correlations also show that for the average sample bank, FFV adjustments are uncorrelated 
with comprehensive income and with comprehensive income adjustments. For 15 sample 
banks, however, FFV adjustments are significantly negatively correlated with comprehen- 
sive income adjustments implying that for these banks FFV adjustments capture a signifi- 
cant degree of hedging of the fair value gains and losses that impact comprehensive income. 
However, for seven other banks FFV adjustments are significantly positively correlated with 
comprehensive income adjustments implying FFV adjustments reveal that these banks are 
exposed to a significant degree of additional risk that is not captured in the fair value gains 
and losses that affect comprehensive income. 

To further explore how various fair-value income components covary, we report in Table 
3, Panel D descriptive statistics on firm-specific correlations among fair-value gains and 
losses of individual components of income that comprise the necessary adjustments to 
compute comprehensive income and FFV income. These statistics indicate that realized and 
unrealized gains and losses on available-for-sale investment securities for the average bank 
are essentially uncorrelated. Not surprisingly, unrealized gains/losses on available-for-sale 
investment securities (a component of comprehensive income) correlate positively with 
comprehensive income for the majority of sample banks. Comprehensive income also co- 
varies positively with unrealized gains/losses on financial assets that are not recognized at 
fair value (e.g., held-to-maturity securities and loans) for 154 sample banks (76 percent of 
the sample; 15 significant), and negatively with unrealized gains/losses on financial liabil- 
ities (e.g., term deposits and long-term financial liabilities) for 138 sample banks (68 percent 
of the sample; 14 significant). Moreover, 164 of 202 sample banks (81 percent) experience 
negative correlation between unrealized gains/losses on financial assets and financial lia- 
bilities, including 65 banks for which it is significantly negative. Finally, these descriptive 
statistics suggest that unrealized gains/losses on non-term deposits correlate negatively with 
comprehensive income for 181 sample banks (90 percent of the sample; 30 significant). 
Together, these descriptive statistics on correlations among fair-value income components 
indicate that volatility in comprehensive income does not fully capture the complex co- 
variation among the various financial assets and liabilities that comprise bank balance sheets 
and risk-management strategies. 


!! Because comprehensive income (CI) is equal to net income (NI) plus other comprehensive income items (OCT) 
the variance in comprehensive income can be expressed as 022 = o^, + са + 2Cov(NI, ОСІ), In similar 
fashion, our measure of FFV income is equal to comprehensive income plus all other fair-value adjustments 
(ЕМА), so the variance in FFV income (FEVI) can be expressed as: сред = о? + од + 2Cov(Cl, ЕМА). 
The extent to which ову differs from о: depends on the relative magnitude of o7,,, and the sign and 
magnitude of Сох(СЕ ЕУА). The Cov(CI, ЕМА) term represents the covariance between comprehensive income 
and all other FVA items (1.е., fair-value gains and losses on all other financial instruments). 
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Tests of the Relations between Income Volatility Measures and Market-Based 
Risk Measures 

We assess whether our income-volatility measures capture market-based risk factors by 
examining their associations with stock-market-based risk measures and banks’ disclosed 
measures of exposure to market risks. We estimate the correlations between each of the 
three income-volatility measures and four measures of banks’ stock-market-based risk: mar- 
ket model beta, total volatility in stock returns, and the absolute values of short-term and 
long-term interest-rate betas. Using data from banks’ market risk disclosures, we also ex- 
amine whether our income-volatility measures correlate with two proxies for banks’ 
exposure to market risks: total exposure to derivatives and the absolute value of the excess 
of fixed-rate assets over fixed-rate liabilities subject to repricing within one vear. If the 
income volatility measures capture elements of market-based risks, then the correlations 
between these risk measures and our income volatility measures should be positive. Further, 
income volatility measures that are more complete measures of banks’ market rsks should 
correlate more positively with market-risk measures. 

For this analysis and most of the remaining analysis of the paper, we estimate our three 
income volatility measures using standard deviations of each income measure stated as a 
percentage of average total assets over five rolling five-year periods, each ending with years 
2000—2004 (rather than estimating standard deviations over the full nine-year period 1996— 
2004 as in Table 3). We adopt a five-year measurement period to allow income volatility 
and market-based risk factors to vary over time, relaxing the restriction that income vola- 
tility and market-based risks are stationary for each bank over nine years. By relaxing 
this restriction, however, we may be adding noise to our tests because standard deviations 
of income over five years may contain more measurement error than stancard devia- 
tions measured over nine years. 

We compute for each bank a market-model beta as a proxy for systematic risk under 
the CAPM. We estimate market model beta by regressing bank stock returns on a market- 
wide index of value-weighted returns over each of the five 60-month periods ending with 
the last month of each year 2000—2004. As a proxy for total risk, we compute the standard 
deviation of raw returns over the same five 60-month periods, each ending with the last 
month of the year 2000—2004. 

Interest-rate risk is a key element of risk for banks, and it relates directly to fair-value 
gains and losses. We estimate two firm-specific interest-rate risk parameters by regressing 
bank stock returns on monthly changes in short- and long-term U.S. Treasury yields over 
each of the five 60-month periods ending with the last month of each year 2000—2004 
(Flannery and James 1984). If a bank has net fixed-rate asset exposure”? (net fixed-rate 
liability exposure) to interest-rate changes, then the bank's share returns should covary 
negatively (positively) with changes in interest rates, leading to more negative ‘more pos- 
itive) interest-rate betas. Thus, either greater net fixed-rate asset or net fixed-rate liability 
exposure could trigger greater volatility in income. Therefore, we test the correlations be- 
tween the three income-volatility measures and the absolute values of short-term and long- 
term interest-rate betas. 

We also examine the relations between our income volatility measures and two dis- 
closed measures of banks’ exposure to market risk (GAP and EXP), measured at the end 


12 Our inferences remain unchanged if we assume that income volatility and market risks are stationary and conduct 
our analyses on firm-specific standard deviations of income during the full nine-year sample period. 

ІЗ Net fixed-rate asset exposure arises when a bank has a greater proportion of fixed-rate assets than fixed-rate 
liabilities. 
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of each year 2000—2004. GAP denotes the absolute value of the excess of fixed-rate assets 
over fixed-rate liabilities subject to repricing within one year, all scaled by total assets. The 
maturity composition of banks’ assets and liabilities is associated with the sensitivity of 
their stock prices to interest rate changes (Flannery and James 1984; Schrand 1997). Be- 
cause GAP risk manifests as change in net interest income, we expect this measure to be 
most closely associated with net-income volatility. We obtain GAP from SNL DataSource. 
SNL obtains GAP data from banks’ Forms 10-K and if GAP is missing from the Form 10- 
K, SNL replaces it with one-year regulatory GAP obtained from banks’ year-end Y9-C 
regulatory reports. 

The second disclosed measure of banks’ exposure to market risk is the total notional 
value of derivatives held at year-end, which we denote EXP. Prior research has documented 
an association between derivatives use and bank stock prices; however, the direction of 
association is unclear. For example, Riffe (1996) finds that the notional amount of swaps 
is positively associated with equity values. Riffe (1996) interprets the positive coefficient 
as evidence that the notional value of swaps is a proxy for higher expected fee income. In 
contrast, Venkatachalam (1996) finds that the notional value of derivatives is negatively 
related to stock prices, after controlling for the fair values of derivatives. Venkatachalam 
(1996) speculates that notional values provide information about risk that may not be fully 
reflected in derivative fair value estimates. Neither of these studies includes income or risk 
measures in tests of association between derivatives disclosures and equity values. In our 
study, EXP equals the sum of the notional values of futures contracts, forwards contracts, 
exchange-written and exchange-purchased options, OTC-written and OTC-purchased op- 
tions, and swaps, all scaled by total assets, at the end of each year 2000—2004. We obtain 
EXP from SNL DataSource, and SNL obtains these data from Y9-C regulatory reports. 

Table 4, Panel A contains descriptive statistics for the four stock-market-based risk 
measures and the two disclosed market-risk measures for our sample of banks for the years 
2000-2004. The mean (median) market model beta is 0.312 (0.299), well below the market- 
wide average of 1.0. Mean (median) absolute values of short-term and long-term interest- 
rate betas are approximately 0.141 (0.101) and 0.225 (0.163), respectively. The mean (me- 
dian) standard deviation in raw returns for our sample banks is 0.051 (0.048). The mean 
(median) repricing GAP in absolute value is roughly 12.6 percent (10.4 percent) of total 
assets. More than half of our sample banks have no exposure to derivatives (EXP), whereas 
the mean bank holds derivatives that amount to roughly 37.3 percent of total assets. This 
skewed distribution is not surprising, given prior findings (e.g., Venkatachalam 1996) that 
larger, more sophisticated banks are more actively involved in derivatives trading than are 
smaller banks.'* 

Table 4, Panel B contains Pearson and Spearman correlation statistics among the three 
income-volatility measures, the four stock-market-based risk measures, and the two dis- 
closed measures of market risk, pooled across years 2000-2004. The most consistent set 
of results emerge from the correlations between the three income-volatility measures and 
the standard deviation of returns, all of which are positive and significant. These results are 
intuitive in that volatility in reported income and volatility in returns are general measures 
of total risk, and are not specific to certain types of risk. These findings confirm that our 
three income-volatility measures capture risk factors that relate to volatility in stock returns. 


14 In untabulated analyses, we find similar descriptive statistics for our four stock-market-based risk measures and 
our two disclosed measures of market risk for the full sample of banks with at least one year of complete 
accounting and stock-market-based variables available during 2000-2004. These descriptive statistics suggest 
our final sample banks have market-risk profiles comparable to the larger population of banks. 
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The results in Table 4, Panel B do not support reliable inferences about whether any 
of the three income-volatility measures correlate with systematic risk (MMBeta). Although 
net income volatility and FFV-income volatility exhibit some evidence of positive corre- 
lation with MMBeta, none of the three income volatility measures exhibits significant pos- 
itive coefficients for both Pearson and Spearman correlations.!? The correlation coefficients 
between the three income volatility measures and the interest rate risk measures suggest 
income volatility covaries positively with interest rate risk. In particular, net income vola- 
tility is consistently positively correlated with both STBeta and LTBeta, and comprehensive 
income volatility is consistently positively correlated with STBeta. The correlations between 
FFV-income volatility and STBeta and LTBeta are mixed. Both Pearson correlation coef- 
ficients are positive and significant (especially with LTBeta) but both Spearman correlation 
coefficients are insignificant. The results are also weak for correlations with net asset- 
liability-repricing risk (GAP). None of the three income-volatility measures has consistently 
significant positive Pearson and Spearman correlation coefficients with GAP. With regard 
to exposure to derivatives (EXP), net-income volatility and FFV-income volatility exhibit 
consistent positive and significant correlations with EXP. Summarizing, these correlations 
indicate that, among our three income-volatility measures, volatility in net income emerges 
as the most robust and consistent measure of risk across the set of risk factors examined 
here. Comprehensive-income volatility appears to reflect a subset of risks, most notably 
volatility in stock returns and short-term interest rate risk. FFV-income volatility reflects 
elements of risk captured by the standard deviation in stock returns and derivatives 
exposure. 

We also explore the incremental market-risk-related information content in our three 
income-volatility measures with two sets of tests.!© First, we examine whether our three 
income-volatility measures provide any market-risk-relevant information beyond the two 
market-risk measures that banks already disclose (1.е., GAP and EXP). Second, we examine 
whether incremental volatility in comprehensive income and FFV income provide market- 
risk-relevant information beyond the volatility in net income, and controls for GAP and 
EXP. To run these tests, we focus on three market risk proxies: MMBeta as a proxy for 
systematic risk, standard deviation in stock returns as a proxy for total risk, and LTBeta as 
a proxy for interest rate risk (each denoted MktRiskProxy in the general models below). 
We estimate the following regression models pooled over the five years 2000—2004: 


MktRiskProxy; = бо + 248200-2003 + SEXP, + & GAP, + басу + 8; (1) 
MktRiskProxy;, = до + 2495555 гюз  8,EXP, + & GAP, + ёоо, + £y; (2) 
MKktRiskProxy, = 8, + Ў, гоз + SEXP, + GAP, + бару + £y; (3) 


+ (сср — сму) + 9,(бүрур — Сеп) + Ee (4) 


Models (1), (2), and (3) test whether any of our three income volatility measures 
provide incremental risk-relevant information beyond GAP and EXP. If so, then 8, should 


15 We do not find consistent evidence of significant positive association between net income volatility and market- 
model beta, inconsistent with prior research on non-bank samples (e.g., Beaver et al. 1970). 
16 We gratefully acknowledge a reviewer for suggesting this analysis. 
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be positive and significant. Model (4) tests whether incremental volatility in comprehensive 
income and/or FFV income provides any market-risk-relevant information beyond the vol- 
atility in net income, and controls for GAP and EXP. If so, then 8, and/or 8, should be 
positive. 

Table 4, Panel C reports the parameter estimates of these models. For the standard 
deviation in stock returns, the results for models (1), (2), and (3) indicate that all three 
income-volatility measures provide incremental risk-relevant information. The results for 
model (4) indicate that incremental volatility in FFV income provides significant incre- 
mental risk-relevant information for total risk in returns, beyond net-income volatility, GAP 
and EXP. The results from all four models for MMBeta indicate that only volatility in FFV 
income provides incremental information for systematic risk. Finally, all four models in- 
dicate that all three income-volatility measures provide incremental information for LTBeta, 
beyond GAP and EXP. The findings for model (4) suggest that volatility in FFV income 
provides the most significant incremental information for LTBeta, as a proxy for interest 
rate risk. 


Tests of Association between Income Volatility Measures and Bank Share Prices 

To address our third and final research question, we conduct two related sets of tests 
of association between volatilities of the three income measures and share prices. Both sets 
of tests provide evidence on our prediction that income volatility is an element of risk that 
increases expected returns and decreases share prices, ceteris paribus. First, we assess the 
share-price implications of each income-volatility measure using a regression approach 
motivated by the residual-income model (Ohlson 1995). Second, we test the implications 
of each income-volatility measure for expected rates of return implied by bank share prices. 


Share-Price Tests Utilizing a Residual Income Approach 

Model (5) is a highly simplified version of the residual-income model (Ohlson 1995). 
We estimate Equation (5) as a benchmark model before introducing any income-volatility 
measures, on a pooled sample over 2000—2004: 


Р, = о + 2405999 2903 + a, BVE, + o, AE, + £y. (5) 


In this equation, P denotes price per share for bank j at the end of t; BVE denotes book 
value of equity per share at the end of year t; and AE denotes period t abnormal earnings 
per share, which we use as a proxy for expected future abnormal earnings. We measure 
abnormal earnings as current period earnings (scaled by the number of shares outstanding) 
less the product of the risk-free rate of return at the beginning of year t times book value 
per share at the beginning of year t. We use the risk-free rate for this computation to allow 
our coefficient estimates to capture the effects of risk. To control for sample-wide mean 
shifts in share prices over our study period, we allow the intercept to shift each year (a;595— 
ои) 

Theoretically, the coefficient estimate оп BVE (а) should equal 1, but omitted variables 
(e.g., off-balance-sheet net asset values and gains or losses that have not yet been realized 
or recognized) may induce this coefficient estimate to deviate from 1. We expect the co- 
efficient on AE (œ) to be positive, reflecting market capitalization of abnormal earnings. 
Because this simplified model only includes abnormal earnings for the current period (rather 


17 The a, coefficient estimate can therefore be interpreted as the intercept for the year 2004, the fifth and final 
year of the sample in this pooled regression. 
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than all future periods), this coefficient should reflect the cross-sectional mean share-price 
effects of persistence, risk, growth, and other information in abnormal earnings (consistent 
with the linear information dynamics assumptions in Ohlson [1995]).18 

To control for information in banks' disclosed measures of exposure to interest-rate and 
market risks, we augment Equation (5) with GAP and EXP. In the model, we interact GAP 
with AE and predict that the coefficient will be negative, as the market assigns a lower 
capitalization multiple to abnormal earnings of banks with greater exposure to interest-rate 
risk (i.e., larger GAP), ceteris paribus. 

As described above, prior research suggests that EXP, our proxy for banks' exposure 
to derivatives, may be associated with higher levels of profitability (Riffe 1996), greater 
risk (Venkatachalam 1996), and may be a proxy for larger banks that are more actively 
involved in derivatives (Venkatachalam 1996). To distinguish the confounding effects of 
higher risk, higher returns, and size associated with derivatives on share prices, we include 
in the regression a main effect for EXP as well as the interaction between EXP and AE. 
We interpret a positive coefficient on EXP as consistent with EXP acting as a proxy for 
larger banks with net unrecorded assets associated with derivatives. We expect the coeffi- 
cient on EXP interacted with AE to be negative because we predict that the capital markets 
assign lower valuation multiples to abnormal earnings for banks with greater exposure to 
derivatives (i.e., greater derivatives exposure is more risky for banks that are less profitable). 
With these controls for banks’ exposure to interest-rate and market risks, we estimate the 
following regression: 


Py = Aq + 240599929 + &,BVE, + e; AE, + (GAP, X АЕ) 
+ a,(EXP, х АБ) + a,EXP, + бу (6) 
We test the share-price effects of the volatilities of the three income metrics by sepa- 
rately interacting each with abnormal earnings, thereby allowing the valuation coefficient 


on abnormal earnings to vary across each income-volatility measure. We estimate the fol- 
lowing models: 


Р, = а) + Хао оз + BVE, + a; AE, + a (GAP, X AE) 


+ a,(EXP, х AE) + EXP, + одох, X АБ) + 5, | (7) 
Pi, = ao + Хазоуо зооз + ОВУ, + о,АЕ, + «,(GAP;, X АБ) 

+ o (EXP, X AB) + а,ЕХР, + а (сс, X AE) + £y (8) 
Р, = Gy + 240599 293 + ОВУ, + e, AE, + a,(GAP, X АВ,) 

+ a EXP, X АБ) + а,ЕХР, + осор X AES) + £g. (9) 


For these models, we compute our income-volatility variables (1.е., Gy, Soy» and Фрруцо 
as bank-specific standard deviations of each income measure expressed as a percentage of 
average total assets. We measure each observation on a rolling basis over the five-year 


15 [n 58 bank-years, abnormal earnings are negative. To avoid the unrealistic assumption of negative abnormal 
earnings in perpetuity, we set abnormal earnings for those bank-years equal to that bank's time-series mean 
abnormal earnings. For the few banks with negative time-series mean abnormal earnings, we set negative ab- 
normal earnings for year t equal to 0.01. 
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periods ending with each year t (2000—2004). In each model, a significant negative a, 
coefficient suggests that the income-volatility metric captures elements of risk that are 
priced in the capital markets, after controlling for GAP and EXP. 

To test the share-price effects of the incremental volatilities of ост and oj, beyond 
cy We run the following regression: 


Р, = ao + Уо з + BVE, + о АВ, + (GAP, X AE) 
+ a EXP, X AE) + o4EXP, + along, Х АБ) 
+ af (Gog — он) X АЕ; + о | (Орруур — суд X АВ, t (10) 


Equation (10) tests whether the capital markets price as risk the incremental volatility 
associated with CI (og, — сул) and FFVI (оъ — сер. Closely related to Equation (10), 
we also estimate the following model using bank share prices at the end of year 2004 (1.е., 
t = 2004) and using income-volatility metrics estimated over the full nine-year sample 
period: 


Py = % + o4BVE, + „АБ, + а (САР, х AE) + a,(EXP, X АЕ) 
+ EXP, + осм. X АЕ) + о (сс — Ом, X АБ] 
+ egl(Opeyi — Soy) X АЕ + Se (11) 


For both Equations (10) and (11), we interpret significant negative a, and a, coefficient 
estimates as evidence that these incremental income-volatility metrics represent elements 
of risk priced by the capital markets. As noted earlier, we predict съъут should be the most 
complete measure of income volatility, and the results in Table 4, Panel C indicate that 
орел Contains significant incremental information for the standard deviation in stock returns, 
market-model beta, and long-term interest-rate beta. Thus, we predict that incremental vol- 
ан у in oye, will have a significant negative association with the capitalization of abnor- 
mal earnings in bank-share prices. 

Table 5 reports the results of estimating models (5) through (11), as well as descriptive 
statistics on variables used only in these regressions (i.e., P, BVE, AE, and AE interacted 
with the volatility metrics). The coefficient estimates on BVE are consistently positive and 
significant, but they are statistically less than the theoretically predicted value of 1 across 
all models except for model (11). The coefficients on AE are consistently positive and 
significant, ranging from 10.19 to 12.60 across models. When interacted with AE, GAP is 
negatively associated with bank share prices, consistent with our predictions that share 
prices reflect lower capitalization multiples to abnormal earnings for banks with greater 
exposure to interest-rate risk. The results also indicate that across models (with the excep- 
tion of model (11)), EXP is consistently positively associated with bank share prices, and 
when interacted with AE, it is consistently negatively associated with share prices. 

With regard to our main predictions, results for models (7) and (9) indicate the coef- 
ficients on the NI and FFVI volatility metrics are negative and significant. In particular, the 
coefficient on the interaction of тєк X AE in model (9) is negative and significant (—0.21; 
t-statistic = —3.54), consistent with сиру; capturing risk-relevant variation in valuation of 


19 This decomposition of standard deviation is consistent with tests гип in prior studies (e.g., Barth et al. 1995) 
and our discussion of the incremental effect of each income-volatility coefficient assumes that variances are 
additive. 
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residual income. By contrast, the coefficient on со X AE in model (8) is negative but not 
significant. This suggests that volatility in CI obscures the risk-relevant information rep- 
resented by volatility in NI and FFVI. 

The results from estimating models (10) and (11) suggest that volatility in FFV income 
captures incremental factors that are associated with the market’s assessment of share-price 
risk, beyond the risk factors represented by volatility in net income and comprehensive 
income, as well as GAP and EXP. Taken further, the significance of the incremental vola- 
tility in Oppy; suggests that сул, сс, GAP and EXP are incomplete measures of income 
volatility and risk exposure for purposes of pricing bank shares.?? 


Tests Utilizing a Cost of Capital Approach 

We predict that volatility in income represents elements of risk that increase expected 
returns and decrease share prices, ceteris paribus. To refine our analysis to focus directly 
on expected returns, we examine whether banks' costs of equity capital are associated with 
each income-volatility measure. If our income-volatility metrics are proxies for elements 
of risk that the capital markets price in expected returns, then they should be positively 
associated with banks' costs of equity capital. 

We derive our cost of equity capital measure using the model presented in Ohlson and 
Juettner-Nauroth (2005) (OJ). Using a dividend valuation model and imposing a number 
of assumptions about near-term earnings and abnormal earnings, OJ derive the following 
earnings-based-valuation formula: 


m eps, — eps, | CoE*dps, | CoE | 
P = eps, ЈЕ : eps, eps, / 12 
9 CoE CoE((1 + CoE) — y) : (12) 


where: 


eps, = forecasted earnings per share in year 1; 
eps, = forecasted earnings per share in year 2; 
dps, = forecasted dividends per share; 
CoE = the implied cost of equity capital; 
y = 1 + the growth rate in abnormal earnings beyond the forecast horizon; and 
P, = value per share. 


Rearranging Equation (12) leads to the following formula for the cost of capital: 


\ 
eps eps, - eps, 
СОЕ = А + ла Ви» (PEA Фа Gy : 13 
Р, m y ) (13) 


20 Consistent with the analysis reported in Barth et al. (1995), we also tested the pricing of each income-volatility 
measure using a рпсе-еапипрв model. The results (not tabulated) are consistent with the results in Table 5. The 
findings indicate that, by itself, сул is negatively related to price-earnings ratios and incrementally negatively 
related to share prices when we include thé volatilities of the two sets of fair-value-income adjustments in the 
regression. The findings indicate that o by itself is not related to ргісе-сагпіпря ratios and (та — оъ) is 
incrementally positively related to price-earnings ratios (opposite the predicted direction) when we include бут 
and/or (ар — Сс) in the regression. Also, consistent with the results in Table 5, the coefficient estimates on 
сред and (орут — Сс) are significantly negative and robust across alternative specifications. 
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where: 


1 . , dps, 
= — + — |. 
^ 2 (9 9 2) 


To compute Equation (13), we use consensus I/B/E/S eps forecasts as of the last date 
for which I/B/E/S compiles forecasts preceding year-end (which they denote as the last 
statistical date) and share prices as of year-end from CRSP. As in Gode and Mohanram 
(2003), we assume y equals 1 + (г, — 3%). 

To test whether banks' equity capital costs are increasing in our income volatility met- 
rics, we run the following regression, again including controls for each bank's disclosed 
exposure to interest-rate and market-risk using GAP and EXP, respectively: 


CoB, = o, + y,GAP, + У,ЕХР, + узом + Ус Ом 
+ уФкруп — Сеп) + бе (14) 


We predict that the coefficient on the control variable for GAP will be positive because we 
predict that the capital markets assign higher expected rates of return when discounting 
earnings for banks with greater degrees of exposure to interest rate risk. We cannot make 
an unambiguous prediction for the sign of the coefficient on EXP in this regression. If EXP 
serves as a proxy for greater profitability and/or larger bank size, then the coefficient 
estimate may be negative, reflecting larger, more profitable banks facing lower expected 
returns. If EXP serves as a proxy for exposure to market risks, then the coefficient estimate 
may be positive reflecting higher expected returns. 

Our primary predictions for these tests are that the coefficients on the incremental 
income-volatility metrics will be positive if the capital markets assign higher expected rates 
of return when discounting earnings for banks with greater volatility in income. Again, as 
in our share-price tests, we predict Oppy, should be the most complete measure of income 
volatility. Thus, we predict that incremental volatility in FFV income will have a significant 
positive association with the cost of bank equity capital. 

We estimate model (14) several ways. First, we estimate it with each annual cross- 
section of sample banks with available I/B/E/S data in years 2000 to 2004.2! For each of 
these models, we compute CoE based on share prices at the end of each year, and we 
compute income-volatility metrics over the five-year period ending with year t. Table 6, 
Columns (a) through (e) present the results for these five annual cross-sectional estimations. 
We also estimate model (14) pooled over the five-year period 2000-2004, and introduce 
separate intercepts for each year 2000—2003 (separate annual intercepts are not tabulated). 
Table 6, Column (f) contains these results. Finally, we estimate model (14) using CoE 
estimates as of the end of year 2004 and firm-specific income-volatility metrics computed 
using nine years of annual income information for our sample from 1996—2004. We present 
those results in Column (g) of Table 6. 

The results in Table 6 indicate that both GAP and EXP are positively associated with 
the cost of equity capital in 2003 and 2004, and in the five-year pooled estimation. Similarly, 
the results indicate that oyy is positively associated with banks’ costs of equity capital in 


21 As noted above, we cannot estimate these models over our full sample of banks because not all sample banks 
are followed by analysts that report their forecasts to I/B/E/S. The subsample varies with forecast data avail- 
ability each year, with the largest sample including 131 banks in 2004. 
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2003, 2004, and in the five-year pooled estimation. The findings suggest that the incremental 
volatility in ос relative to oy, is not related to banks’ costs of equity capital in any of the 
models. In Table 6, the most robust result is that the incremental volatility in az; relative 
to са is significantly positively related to banks’ costs of equity capital in four of the five 
annual cross-sectional tests (years 2000—2003), in the pooled model, and in the cross- 
sectional test for 2004 with volatility metrics measured over the full nine-year period. As 
in our share-price tests, these results are consistent with our prediction that volatility in 
full-fair-value is a more complete representation of bank risk than volatility in either net 
income or comprehensive income, and the cost of bank equity capital is increasing in 
incremental volatility in full-fair-value income.” 


Robustness Tests 

To examine the robustness of our results, we estimated four sets of alternate specifi- 
cations for the abnormal-earnings-model regressions and obtained results (untabulated) sim- 
ilar to those described in the text and Table 5. First, to verify that the share-price tests we 
report in Table 5 hold for the subset of bank-years we use for our cost of capital tests in 
Table 6, we used I/B/E/S analyst forecasts of future earnings as proxies for expected 
earnings in computing AE. The sample sizes for the cost of capital tests are substantially 
: smaller than those in the share price tests in Table 5 because of the number of banks in 
the sample that lacked analyst coverage. Despite the reduction in sample size, inferences 
remain unchanged with this specification, although the R?s declined in all models. 

As a second alternative specification of models (7) through (10), we computed standard 
deviations of the income metrics by deflating income by book value of equity. We then 
included these standard deviations of income in a book-value-deflated abnormal-earnings 
model (i.e., a market-to-book specification of models (7) through (10) in Table 5). Under 
this specification, inferences, significance levels, and R?s remain qualitatively similar to 
those reported in Table 5. 

As a third alternative, we used earnings in lieu of abnormal earnings. Inferences remain 
unchanged and the significance levels and R?s increased in all models. We did not tabulate 
these results because we believe that the originally computed abnormal earnings proxy is 
more theoretically consistent with the residual-income-valuation model. Finally, we further 
augmented the model by adding an interaction term consisting of book value times each 
of the income-volatility measures. The coefficients on the book-value/income-volatility- 
interaction terms are negative and highly significant. The direction and magnitude of the 
coefficients on the abnormal-earnings/income-volatility interactions are similar to those 
presented, but are less significant. 


V. CONCLUSIONS 
Using a sample of 202 commercial banks over 1996—2004, we analyze volatility in 
three alternate measures of income: reported net income, reported comprehensive income, 
and a constructed measure of full-fair-value income. We find that the volatility of full-fair- 
value income is significantly greater than the volatility of net income and comprehensive 


= The results in Table 5 indicate that incremental volatility in ЕРУ income negatively moderates the capitalization 
of abnormal earnings in bank share prices. We attribute these results to volatility in FFV income capturing 
incremental risk-relevant information but the findings could also be attributable to lower persistence in FFV 
income. Consistent with our risk explanation, the results we report in Table 4, Panel C provide strong evidence 
that incremental volatility in FFV income contains incremental risk-relevant information beyond volatility in net 
income and comprehensive income, and beyond banks' disclosed measures of market-based risks (i.e., repricing 
gap and derivatives exposure). Further, the results we report in Table 6 indicate that banks' costs of capital are 
increasing in FFV income volatility, consistent with our risk explanation. 
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income for nearly all of our sample banks. Interestingly, we find that average levels of 
these three income measures over these nine years are similar in magnitude, suggesting 
that fair-value adjustments in both comprehensive income and full-fair-value income zero 
out over our sample period. 

Consistent with bank representatives’ criticisms of comprehensive income during the 
deliberation of SFAS No. 130, we find that reported comprehensive income correlates neg- 
atively with unrecognized fair-value changes in financial instruments. In leveling their crit- 
icisms, these individuals claimed that comprehensive income would paint an unrealistically 
harsh picture of banks’ core operations and risk-management practices because banks use 
their total portfolios of financial instruments to manage risk, and other comprehensive 
income only captures fair-value changes for a small subset of financial instruments (i.e., 
available-for-sale securities). Contrary to this claim, we find that the partial-fair-value- 
income accounting and reporting currently mandated in SFAS No. 115 and 130, respec- 
tively, substantially understates the variability of banks’ full-fair-value income. Dur results 
suggest full-fair-value accounting by banks for all financial asset and liability positions 
reflects more completely the effects of banks’ exposure to changes in fair values of financial 
instruments. 

We also investigate the extent to which the volatilities of alternative-income metrics 
capture market-based risks, including four stock-market-based risk measures (1.8., market- 
model beta, standard deviation in stock returns, and short-term and long-term interest-rate 
betas) and two disclosed measures of banks’ exposure to market risks (i.e., total derivatives 
and interest repricing gap). We find volatility in net income is consistently pos:tively cor- 
related with all of those market-risk factors with the exceptions of market-model beta and 
repricing gap. Volatility in comprehensive income is only consistently positively correlated 
with standard deviation in returns and short-term interest-rate beta. We find that volatility 
in FFV income is significantly positively associated with the standard deviation in banks’ 
stock returns and derivatives exposure. We also find that the incremental volatility in FFV 
income is positively associated with standard deviation in returns, market model beta, and 
long-term interest-rate beta, beyond the effects of volatility in net income and comprehen- 
sive income, and after controlling for repricing gap and derivatives exposure. These results 
provide support for the conclusion that volatility in FFV income reflects the effects of risk 
factors that are not completely captured by volatility in net income or comprehensive in- 
come, or by banks’ disclosures of exposure to market risks. 

Across our share-price and cost of capital analyses, we find that ЕРУ-шсошг volatility 
significantly negatively moderates the capitalization of abnormal earnings in share prices, 
and is positively associated with implied required rates of return on bank equ:ty capital, 
consistent with the capital markets pricing FFV-income volatility as risk. We find that 
volatility in FFV income is negatively associated with abnormal-earnings-capitalization 
multiples and positively associated with required rates of return even after controlling for 
volatilities in net income and comprehensive income and after controlling for information 
in banks' market-risk disclosures regarding repricing gaps and derivatives exposures. 

Our findings imply that FFV-income volatility for banks captures important, value- 
relevant elements of risk that are priced by the capital markets. In addition, FFV-income 
volatility appears to reflect elements of risk that are not captured by volatility in net income 
or comprehensive income, or disclosed measures of market risks. We believe these findings 
are useful to the FASB and the IASB as they consider whether to develop accounting 
standards to recognize changes in fair values for all financial instruments in income. We 
believe the findings are also useful to capital-markets participants and researchers interested 
in explaining banks' share prices and risk. Further, we expect these results are useful to 
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bank regulators as they evaluate the role of fair value measurement in bank supervision and 
regulatory capital monitoring. 

We increase the power of our tests by concentrating our analysis on a single industry 
in which financial instruments are centrally important for operations and risk management. 
This enhanced statistical-conclusion validity comes at a cost, because our sample of banks 
may not be representative of the broad cross-section of U.S. and international companies 
and industries. 


APPENDIX 
Variable Descriptions and Calculation of Full-Fair-Value Income 
This appendix provides descriptions of the accounting variables in this study and 
sources of data used in their computation. 


Accounting Variables Used in the Analysis 

* NI = reported net income obtained from SNL DataSource. 

® CI = reported comprehensive income obtained from SNL DataSource. CI equals NI 
plus unrealized fair-value gains and losses on available-for-sale securities plus other 
comprehensive income items (i.e., foreign currency translation adjustments and pen- 
sion adjustments). 

® FFVI = full-fair-value income. FFVI equals CI plus fair-value adjustments for un- 
recognized gains and losses on financial instruments (FVA). 

* FVA - fair-value adjustments for unrecognized fair-value gains and losses on finan- 
cial instruments. We compute fair-value adjustments as the change in the excess of 
fair value over book value of financial instruments. We obtain fair values of recog- 
nized financial instruments from SNL DataSource. We obtain fair values of off- 
balance-sheet items including derivatives from bank holding company regulatory 
reports (Form Y9-C). We derive fair values of non-term deposits from bank holding 
company regulatory report data using the approach discussed more completely be- 
low. We adjust all fair-value adjustments net of tax effects using the top statutory 
U.S. Federal corporate income tax rate (35 percent). 

• Non-term deposit liabilities = Under current accounting standards, the fair value of 
non-term deposit liabilities may be assumed to be equal to the face amount plus 
accrued interest, therefore there are no existing disclosures of the fair values of non- 
term deposits. Under the proposed standard, the fair value of non-term deposits 
depends on the estimated timing and amounts of withdrawals, the level of interest 
rates on other liabilities with similar terms, the cost of servicing the deposits and 
the financial institution's own credit risk. Valuation techniques estimate the present 
value of the expected future cash flows by discounting the expected cash flows at a 
rate equal to that used by the market to price loans of similar risk and effective 
maturities. When the market rate is higher (lower) than the total cost of deposits, 
the fair value of the liabilities will be less (greater) than face value. 


For purposes of our analysis non-term deposits are valued using a static discounted 
cash flow approach used by bank regulators together with firm-specific deposit data obtained 
from bank holding company call reports. The discount equation is shown below: 


360 
PV Non-term deposits = 2 CF, df, (Al) 
t 
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where: 


РҮ = present value of non-term deposit balance; 
CF, = expected cash outflow resulting from that balance in month t; and 
df, = discount factor applicable to month 7. 


Cash flows are discounted by the zero-coupon treasury rate appropriate to each month 
plus a spread. The spread is a parameter that calibrates demand deposit prices to observed 
prices for deposit purchase/assumption transactions and is obtained from the U.S. Treasury 
Department website (http://www.treasury.ots.gov). 

Cash flows in the discount equation are given by the following: 


CF, = NIC(B,_,) + [1 + r)B, , — B] (A2) 
where: 


CF, = cash flow in month t; 

NIC = rate of noninterest cost of retail deposits; 
В, = demand deposit balance at end of month t; and 
г, = deposit rate in month 7. 


The deposit rate (7,) in any month is assumed to be a function of three factors: (1) the 
rate paid by the institution in the prior month, (2) projected market rates for “їе current 
month, and (3) the difference between the rates paid and market rates, which are assumed 
_ to gradually converge in equilibrium. Specifically: 


Yr, = а + CV. — т) + ЖК, К) + ек, — R3) + С, (A3) 
where: 


r, = deposit rate in month /; 

К, = implied forward short-term rates in month 7; 

С, = adjustment toward equilibrium rate = (то; — Е, |), ifr, , = E, , 
gr, — E), if r1 < Е, | ; and 

E, = equilibrium interest rate for demand deposits = a + bRt. 


The parameters (a-g) in the above equations are determined using historical data. For 
example, a and b are parameters obtained from regressing deposit rates on one-month 
LIBOR while c, d, and e are obtained from a regression of current rates on lagged changes 
in rates. Estimates for all parameters are obtained from the U.S. Treasury Department 
website (http:// www.treasury.ots.gov). 

Finally, balances in non-term deposit accounts are assumed to decline over time with 
the rate of decline influenced by the relation of the rate paid on the account to market rates 
(the ratio of 7,/R,). The decline is nonlinear and assumed to take the following Zorm: 


B, = B,_\(h  iarctan(k + jr/R) + Ir)! (A4) 
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where: 
В, = non-term deposit balance at end of month 5 
arctan = arctangent function; 
r, = interest rate offered on deposits in month ¢ by institution i; and 
К, = implied forward short-term rates in month г (annual percentage form). 


As with the other equations, all parameters in Equation (A4) (1-1) are based on aggre- 
gate historical balance migration data obtained from the U.S. Treasury Department. The 
short-term market rate series used is one-month LIBOR. Annual deposit servicing cost 
parameters were obtained from the Office of Thrift Supervision “Selected Asset and Lia- 
bility Price Tables" (http://www.ots.treas.gov). Because firm-specific transaction account 
interest costs were not available, the interest component of cost was estimated by matching 
the quintile rank of the financial institution based on total deposit cost to the mean interest 
rate for that quintile of transaction deposits from the Selected Asset and Liability Price 
Tables. 
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ABSTRACT: Prior research demonstrates that forecast optimism Is, in part, a conse- 
quence of analysts' cognitive reactions to the scenarios managers use to communicate 
future plans. In two experiments, we examine whether counter-explanation (explaining 
why managers’ plans could fail) reduces scenario-Induced optimism. We find that when 
compared to analysts not asked to generate counter-explanatlons, analysts who com- 
plete the relatively easy task of generating few counter-explanations make less opti- 
mistic forecasts, but analysts who complete the relatively difficult task of generating 
many counter-explanations do not. Results demonstrate the usefulness of a cognitive, 
theory-based mechanism for reducing forecast optimism and suggest a boundary con- 
dition for the use of that mechanism. 


Keywords: counter-explanation; availability; EPS; forecast optimism. 
Data Availability: Contact the authors. 


I. INTRODUCTION 
n a move applauded by Wall Street analysts, Coca-Cola Co. announced that it would 
Г longer provide quarterly earnings guidance because such short-run guidance “‘pre- 
vents a more meaningful focus on the strategic initiatives that a company is taking to 
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build its business and to succeed over the long run” (Wall Street Journal Опйке 2002). In 
connection with this shift away from providing quarterly earnings guidance, investor- 
relations and other executives are moving toward guiding investors' expectations for annual 
earnings (National Investor Relations Institute [NIRI] 2003; Zuckerman 2005). Managers 
eliminating short-run earnings guidance to promote a focus on long-run performance will 
likely increase their use of qualitative or narrative communications regarding their plans 
and future earnings expectations. Yet, managers’ use of detailed, causal narratives (1.е., 
scenarios) to communicate plans to improve future earnings facilitates analysts' ability to 
envision how and why these plans will improve future firm performance, resulting in op- 
timistic analysts' earnings forecasts (Sedor 2002). In this paper, we test whether generating 
counter-explanations (i.e., reasons why managers' plans could fail) mitigates cptimism in 
analysts’ forecasts. We also provide evidence regarding a boundary condition for the use 
of counter-explanation in this domain. 

It is likely that optimism in analysts' forecasts is costly to investors because investors 
do not appear to unravel analysts' forecast optimism completely (e.g., Dechow et al. 2000; 
Bradshaw et al. 2003). In fact, investors who use analysts' forecasts to estimate firm value 
could improve the quality of their valuation estimates by correcting for analysts’ optimism 
(Frankel and Lee 1998). Additionally, analysts appear to use their annual earnings forecasts 
to develop their recommendations (e.g., Bradshaw 2004), and the accuracy of analysts' 
forecasts is associated with the profitability of their contemporaneous recommendations 
(Loh and Mian 2005). Consequently, reducing optimism in analysts' earnings forecasts also 
could result in higher returns for investors relying on analysts' recommendatiors. 

Drawing on theory in psychology, we propose that asking analysts to generste counter- 
explanations will reduce forecast optimism. Our theory further suggests that (he ease or 
difficulty that analysts experience while generating counter-explanations will influence the 
effect of the counter-explanation mechanism on analysts' forecasts. Consistent with avail- 
ability, we expect that the ease or difficulty of generating counter-explanations will act as 
a heuristic cue and qualify the influence of counter-explanation content on analysts’ fore- 
casts.! Specifically, we hypothesize that optimism will be reduced when analysis generate 
counter-explanations with relative ease, but not when analysts generate counter-explanations 
with relative difficulty. 

In Experiment 1, we provide professional analysts with summarized financial infor- 
mation for a firm and a scenario describing managers' plans to improve future earnings. 
From this information, analysts in the control condition provide one-year- and two-year- 
ahead annual earnings forecasts. Analysts in the experimental conditions list reasons why 
managers' plans could fail (i.e., generate counter-explanations) prior to providing their fore- 
casts. We manipulate the availability of counter-explanations by asking analysts to generate 
either few (which is relatively easy) or many (which is relatively difficult) counter- 
explanations. We find that analysts who perform the relatively easy task of generating few 
counter-explanations make annual earnings forecasts that are lower than the forecasts made 
by analysts in the control condition. In contrast, analysts who perform the relatively difficult 
task of generating many counter-explanations make annual earnings forecasts that do not 
differ from the forecasts made by analysts in the control condition. We find this -esult even 





! Individuals employ the availability heuristic when they estimate probabilities or frequencies by the ease with 
which relevant information can be brought to mind (Tversky and Kahneman 1973). The term availability reflects 
the use of two types of mental operations that bring information to mind: direct recall and ease of generation 
(Hoch 1984). We focus our investigation on availability as the ease (or difficulty) of generation. 
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though analysts in the generate many condition generate more reasons why managers’ plans 
could fail than do analysts in the generate few condition. 

In Experiment 2, we provide additional evidence that the availability of counter- 
explanations, and not differences in counter-explanation content, explains Experiment 1 
results. In particular, in Experiment 2, we replicate Experiment 1 with M.B.A. student 
participants and add two experimental conditions in which all counter-explanations (re- 
gardless of number) are equally available to participants. Each participant in these new 
conditions reads a reproduction of the counter-explanations generated by another participant 
instead of generating his or her own. Specifically, each participant in the read few (read 
many) condition is presented with the set of counter-explanations created by a matched 
participant in the generate few (generate many) condition. This yoked design ensures that 
the quality and order of counter-explanations are identical between generate and read 
conditions. 

Consistent with Experiment 1 results, we find in Experiment 2 that participants asked 
to perform the relatively easy task of generating few counter-explanations make annual 
earnings forecasts that are lower than the forecasts made by participants in the control 
condition, but that participants asked to perform the relatively difficult task of generating 
many counter-explanations make annual earnings forecasts that do not differ from the fore- 
casts made by participants in the control condition. We also find that participants who read 
the counter-explanations generated by other participants, regardless of the number of 
counter-explanations read, make annual earnings forecasts that are lower than the forecasts 
made by participants in the control condition. If differences in counter-explanation content 
explained results, then forecasts made by read condition participants would mirror forecasts 
made by generate condition participants (Wanke et al. 1996). In contrast, forecasts made 
by participants who read either the few or many counter-explanations generated by matched 
participants are lower than the forecasts made by participants who neither generated 
nor read counter-explanations, supporting our theory that the availability of counter- 
explanations explains reported results. 

Our research makes three contributions. First, we identify and test counter-explanation 
as a mechanism to reduce optimism in analysts' forecasts. Prior research suggests that 
optimism is due, in part, to managers' use of scenarios to communicate their plans to 
improve future firm performance (Sedor 2002). Because managers are likely to continue 
using scenarios to communicate their plans, recipients of those communications bear the 
burden of taking measures to control their cognitive reactions to the scenarios. The counter- 
explanation mechanism is practical and could be implemented by analysts and others who 
wish to avoid optimism, but who rely on causal narratives that describe positive future 
outcomes when developing forecasts. 

Second, our results provide insight into potential causes of forecast optimism among 
analysts. In particular, results are consistent with analysts failing to consider spontaneously 
a small number of counter-explanations when contemplating scenarios describing managers' 
plans. To the extent that scenario-induced optimism is manifest in analysts' forecasts of 
annual earnings, failing to consider counter-explanations may be one factor contributing to 
the optimism. Finally, we demonstrate an important boundary condition for the use of 
counter-explanation as a debiasing mechanism. In both experiments, we find that the ability 
to generate counter-explanations with ease is critical to the effectiveness of this mechanism. 

We use an experimental methodology for our investigation because archival data neither 
provide information regarding the cognitive forecasting processes used by professional an- 
alysts nor a control condition against which to compare the effects of manipulating counter- 
explanation availability among analysts. Additionally, we hold constant investment banking 
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relationships, which enables us to examine how forecast optimism is affected by counter- 
explanation in the absence of economic incentives. A limitation of our experimental ap- 
proach is that we cannot measure absolute levels of forecast optimism, and thus the evidence 
is limited to relative levels of optimism across experimental conditions. 

The remainder of this paper is organized as follows. Section II analyzes relevant lit- 
erature and develops the first set of hypotheses. Section Ш describes the design, method, 
and results for Experiment 1. Section IV develops the second set of hypotheses and de- . 
Scribes the design, method, and results for Experiment 2. The final section summarizes and 
concludes. 


II. LITERATURE ANALYSIS AND HYPOTHESES 
Forecast Optimism and Scenarios 

Optimism in annual earnings forecasts is statistically and economically significant (e.g., 
Darrough and Russell 2002; Duru and Reeb 2002) and increases over the forecast horizon: 
analysts’ two-year-ahead forecasts are relatively more optimistic than their one-year-ahead 
forecasts (DeBondt and Thaler 1990; Frankel and Lee 1998; Bradshaw et al. 2003). Al- 
though optimism in quarterly earnings forecasts decreases as the earnings announcement 
date approaches (e.g., Brown 1997, 2001; Matsumoto 2002; Richardson et al. 2004), ana- 
lysts' forecasts of annual earnings are often used to make investment decisions and to 
estimate firm value.? 

Much, but not all, of the existing literature has focused on incentive-based explanations 
for analysts' forecast optimism.? Recent research, however, suggests that analysts are un- 
likely to issue intentionally optimistic forecasts (Eames and Glover 2003; Irvine 2004).* 
Consistent with forecast optimism being driven by something other than analysts' responses 
to economic incentives, market analysts (Darrough and Russell 2002), analysts employed 
by independent research firms (Jacob et al. 2003), and students (Affleck-Graves et al. 1990) 
also make optimistic earnings forecasts. 

We focus our investigation on a key aspect of analysts' information environment: the 
detailed, causal narratives (i.e., scenarios) that managers commonly use to communicate 
their plans to improve future firm performance (via conference calls, press releases, and 
other communications). Optimism in analysts' annual earnings forecasts derives in part from 
analysts' cognitive reactions to these scenarios (Sedor 2002). When analysts contemplate 
managers' plans in scenarios, they envision in concrete detail how managers will implement 
plans and, ultimately, how and why those plans will improve future firm performance (Sedor 


~ 


For example, empirical applications of the residual income model often use analysts’ forecasts as proxies for 
the market’s expectations of future earnings (e.g., Frankel and Lee 1998; Dechow et al. 1999; Lee et al. 1999; 
Liu and Thomas 2000; Sougiannis and Yaekura 2001; Ali et al. 2003). Bradshaw (2004) suggests that analysts" 
recommendations reflect a simple heuristic based on forecasts of annual earnings and long-term earnings growth. 
Ohlson and Juetmer-Nauroth (2005) also model firm value as a function of annual earnings and earnings growth 
forecasts. 

3 Analysts may provide optimistic forecasts in order to maintain or gain access to information (Francis and 
Philbrick 1993; Krishnan and Sivaramakrishnan 1998; Lim 2001), to motivate trades (Kim and Lustgarten 1998), 
and to obtain investment banking business (Dugar and Nathan 1995; Hunton and McEwen 1997; Lin and 
MeNichols 1998; Dechow et al. 2000; Bradshaw et al. 2003). Other explanations include justifying favorable 
recommendations (Eames et al. 2002), failing to adjust for earnings management (Abarbanell and Lehavy 2003), 
and choosing to report only on stocks that are viewed favorably (McNichols and O’Brien 1997). 

* This is consistent with research suggesting that analysts’ reputations depend on accurate forecasts and recom- 

mendations (Michaely and Womack 1999; Loh and Mian 2005). Additionaliy, greater forecast accuracy decreases 

the likelihood of termination (Mikhail et al. 1999) and increases the likelihood of moving from a lower to a 

higher status brokerage firm (Hong and Kubik 2003). 
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2002). Envisioning ways in which managers’ plans will improve future firm performance 
increases the availability of those plans and the associated positive outcomes to analysts 
(1.е., the plans and resultant performance improvements are easy to bring to mind). Indi- 
viduals use the ease of bringing information to mind as a heuristic cue when making 
judgments about probabilities or frequencies (Туегѕку and Kahneman 1973; Kahneman and 
Tversky 1982; Kahneman and Lovallo 1993). Consequently, analysts will infer from the 
availability of managers' plans and the associated positive outcomes that future performance 
improvements are likely, and optimism in analysts' forecasts will result. 


Scenarios and Counter-Explanation 

Effective debiasing of scenario-induced forecast optimism requires developing a mech- 
anism that addresses the optimism's cognitive source (Arkes 1991; Kennedy 1993, 1995; 
Bonner et al. 1996). For instance, the mechanism could increase the availability of causal 
chains of events leading to alternative (less positive) outcomes. One such mechanism is 
counter-explanation. 

Prior research conducted in the context of analytical review tasks demonstrates that 
counter-explanation reduces auditors' likelihood assessments that unusual account balance 
fluctuations were driven by hypothesized causes. Heiman (1990) showed that both gener- 
ating and reading counter-explanations reduced auditors' likelihood assessments for an 
experimenter-suggested cause. Koonce (1992) had a group of auditors generate counter- 
explanations to a manager-suggested cause for an unusual account balance fluctuation. She 
found that the auditors who generated counter-explanations rated the manager's explanation 
as less likely than did auditors who did not generate counter-explanations. Similarly, we 
expect that generating counter-explanations to managers' plans to improve future firm per- 
formance (that is, generating reasons why managers’ plans could fail) will reduce analysts’ 
beliefs that the managers' plans are likely to be successful and will mitigate analysts' fore- 
cast optimism.* 

Heiman (1990) found that auditors who considered five counter-explanations made 
lower likelihood assessments than did auditors who considered two counter-explanations. 
This finding suggests that the larger the number of counter-explanations considered by 
analysts, the greater the anticipated effect of the counter-explanations on forecasts. How- 
ever, we draw on more recent psychological theories to develop hypotheses regarding a 
boundary condition for the use of counter-explanation to mitigate analysts' forecast 
optimism. 


Boundary Condition: The Experienced Ease or Difficulty of 
Generating Counter-Explanations 

We apply the theoretical framework of availability to understand the expected boundary 
condition for the use of counter-explanation as a debiasing task. There are two sources of 
information associated with items, such as counter-explanations, that are relevant to judg- 
ments and decisions: the information content of the items and the experienced ease or 
difficulty with which those items are brought to mind (ie., availability) (Hoch 1984; 
Schwarz et al. 1991; Schwarz 1998). The availability of information often affects the influ- 
ence of information content on judgment (Sanna et al. 2002; Wanke et al. 1997; Schwarz 


* To the extent that analysts’ professional responsibilities include skepticism, such skepticism could encourage 
spontaneous counter-explanation and would bias against our ability to detect an effect when analysts are asked 
to generate counter-explanations (Heiman 1990; Koonce 1992). 
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et al. 1991). This availability effect is most likely when individuals consider themselves to 
be knowledgeable in a domain because they then believe that the ease or difficulty they 
experience when bringing information to mind is informative (Schwarz 1998). 

Consistent with availability affecting the influence of information content on judgment, 
Sanna et al. (2002) found that asking participants to generate two thoughts about how a 
war could have turned out differently (which participants found easy to do) attenuated 
hindsight (i.e., reduced their exaggerated assessments of the inevitability of past events). 
However, asking participants to generate ten thoughts (which participants found difficult to 
do) exacerbated hindsight effects, despite the fact that the latter group identified significantly 
more thoughts about how the war's outcome could have occurred differently than did the 
former group. 

Similarly, we expect that the availability of counter-explanations (1.е., the ease or dif- 
ficulty of their generation) will influence whether counter-explanation mitigates scenario- 
induced optimism. If analysts generate counter-explanations with relative ease, then we 
expect that analysts will use the availability of the counter-explanations as a heuristic cue 
suggesting that future performance improvements are unlikely (because it is easy to bring 
to mind ways that managers' plans could fail to improve future performance). As a result, 
forecast optimism will be reduced (Hypothesis 1). On the other hand, if analysts generate 
counter-explanations with relative difficulty, then we expect that analysts will use the lack 
of available counter-explanations as a heuristic cue suggesting that future performance 
improvements are likely (because it is difficult to bring to mind ways that manégers’ plans 
could fail to improve future performance). In this case, forecast optimism will not be 
reduced (Hypothesis 2) and may even be increased. However, with additional counter- 
explanation content, it is difficult to predict the net effect of requiring the generation of 
many counter-explanations. Thus, in recognition of the competing influences of counter- 
explanation availability and additional counter-explanation content, our second hypothesis 
is not directional. We test the following predictions: 


H1: Analysts who are asked to generate few counter-explanations of managers' plans 
to improve future earnings will make earnings forecasts that are less optimistic 
than the earnings forecasts made by analysts who are not asked to generete counter- 
explanations. 


H2: Analysts who are asked to generate many counter-explanations of managers' plans 
to improve future earnings will make earnings forecasts that do not differ from the 
earnings forecasts made by analysts who are not asked to generate counter- 
explanations. 


П. EXPERIMENT 1 
In Experiment 1, we examine whether generating counter-explanations can reduce 
scenario-induced optimism in analysts’ earnings forecasts and whether the availability of 
counter-explanations influences the effectiveness of this mechanism. We use a 1 X 3 





Ф 


It is unclear, ex ante, whether the effects of counter-explanation observed by Sanna еї al. (2002) -п the context 
of debiasing hindsight (exaggerated retrospective likelihood assessments) and Heiman (1990) and Koonce (1992) 
in the context of reducing reliance on initial hypotheses in diagnosis (also exaggerated retrospective likelihood 
assessments) will be observed in the context of debiasing forecast optimism. Thus, we test whether counter- 
explanation tasks will influence prospective judgments in a way that mirrors their effect on retrospective 
judgments. 
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between-subjects design with the number of counter-explanations participants are asked to 
generate (none, few, many) as the independent variable. For presentation purposes, we 
calculate our dependent variable by subtracting current earnings per share (EPS) as provided 
to participants from participants’ annual EPS forecasts for one year and two years ahead. 


Design and Method 

Participants are 59 financial analysts recruited via telephone calls placed by the authors 
using Nelson’s Directory of Investment Research (2003). We awarded a prize of $500 to 
one participant via random drawing as a token of our appreciation. Participants reported a 
mean (median) of 11 (8) years of experience. Ninety-three percent of participants identified 
themselves as sell-side analysts, 36 percent as certiüed financial analysts (CFAs), and 37 
percent as being named by Institutional Investor as All-Star Analysts. Participants reported 
8 mean (median) of 1.4 (1.0) industry specializations, with the most common being enter- 
tainment (19 percent), telecommunications (19 percent), health (10 percent), and transpor- 
tation (10 percent). We used a predetermined randomization plan to assign participants to 
experimental conditions as they were recruited. We provided each participant with an elec- 
tronic link to the experimental materials for his or her assigned condition, and participants 
completed the study online.* 


Case Materials and Procedures 

Case materials were adapted from the loss-firm/scenario-structure condition used in 
Sedor (2002). We focus on a loss firm for several reasons. First, the effect of scenario- 
induced optimism on analysts' earnings forecasts is larger for firms reporting losses than it 
is for firms reporting profits (Sedor 2002). Correspondingly, the most optimistic annual 
earnings forecasts issued by analysts are associated with firms that report losses in the prior 
year (Klein 1990; Ali et. al. 1992; Dowen 1996) or that have weak prior earnings perform- 
ance (Abarbanell and Bernard 1992; Elgers and Lo 1994). Finally, firms reporting consec- 
utive losses are likely to discontinue earnings guidance (Graham et al. 2005). Thus, we 
believe that managers of loss firms are more likely to use qualitative or narrative commu- 
nications when presenting their plans and future expectations. 

To avoid the influence of economic incentives on forecasts, we instructed each partic- 
ipant to assume that he or she was initiating coverage of a company (Villa Fiore Winery) 


? Including either the author who recruited the analyst, years of experience, CFA designation, or All-Star Analyst 
rating in the models when testing our hypotheses does not change reported inferences. Additionally, inferences 
remain unchanged if data from buy-side analysts or the analyst who indicated a beverage specialization are 
excluded. 

* Although approximately equal numbers of participants were assigned to each condition, and approximately equal 

numbers of participants in each condition accessed the study, participants in the generate many condition were 

somewhat less likely to complete both sets of forecasts than were participants in the other two conditions (x? 

= 5.12, p = 0.08), and thus the cell size for the generate many condition is the smallest. However, of the 16 

analysts who dropped out before completing the study, 12 did so before making initial forecasts. These partic- 

ipants could not have seen the counter-explanation manipulation because it took place after initial forecasts were 
elicited. Fisher's Exact Test also reveals that the timing of dropouts does not differ by experimental condition 

(p = 0.73). Thus, it is unlikely that the manipulation caused differences in drop out rates across experimental 

conditions. 

Sedor's (2002) materials were constructed from Forms 10-K filed by companies in the wine industry. Advantages 

of these materials include (1) at (and before) the time the experiment was conducted, the press had not reported 

any noteworthy news events for the wine industry (e.g., excessive supplies, recalls); (2) the wine industry is not 
followed by a large number of analysts (minimizing any potenzial advantage of industry-specific expertise); and 

(3) these materials induced scenario thinking among financial analysts in Sedor (2002). We updated all materials 

for recent industry events, but maintained their original structure. 


м 
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that did not have (and did not contemplate) an investment banking relationship with the 
participant’s firm. Participants began by reading three years of summarized financial infor- 
mation (including net losses and negative EPS for the current and prior two years), a brief 
description of the company, and product information. Participants then provided initial 
forecasts of annual EPS for one year and two years ahead.'° Participants had access to the 
summarized financial information while making their initial forecasts. Next, participants 
read an excerpt from a conference call held after the company’s annual earnings announce- 
ment. The conference-call excerpt included the CEO’s discussion of the industry, the com- 
pany, and a description of managers’ plans to improve future earnings in scenari» structure. 
Participants then generated counter-explanations (if applicable) and made final EPS fore- 
casts. Electronic links provided access to the summarized financial information and to the 
conference-call excerpt (but not the counter-explanations, if applicable) while participants 
were making their final forecasts. Finally, participants completed a post-experimental 
questionnaire. 


Independent Variable 

We manipulated the availability of counter-explanations (i.e., the ease or difficulty of 
their generation) by asking participants to generate varying numbers of counter-explanations 
after they read the conference-call excerpt, but before thev made final forecasts. Specifically, 
we provided participants in the generate few (generate many) condition with the following 
instruction: 


Imagine that firm policy requires you to document (as part of your analysis) reasons 
why managers’ plans COULD FAIL. Below, please provide TWO (TWELVE) reasons 
why the plans presented by Villa Fiore's managers during the conference call COULD 
FAIL to increase sales and profitability as expected. 


The online instrument contained either two or 12 blank lines for participants’ counter- 
explanations, depending on experimental condition. The instrument was programmed 
such that participants were required to input the requested number of counter-explanations 
before continuing. Control condition participants were not explicitly asked to generate 
counter-explanations; instead they made final EPS forecasts immediately after reading the 
conference-call excerpt. 


Dependent Measures 

Participants provided initial and final forecasts of annual EPS for one year and two 
years ahead. For presentation purposes, we subtract current (year 0) EPS of —S0.64 from 
participants’ EPS forecasts to arrive at participants’ forecast increases in EPS. The depen- 
dent measures are participants’ final forecast increases in EPS for both one year (year 1) 
and two years ahead (year 2). We use participants’ initial forecast increases in EPS as 
covariates in the analyses. Table 1, Panel A reports descriptive statistics for Experiment 1, 
including means and least squares (adjusted) means for final forecast increases in EPS for 
years 1 and 2. 


10 Participants also forecasted five-year long-term growth rates (LTG). The pattern of LTG matches that of EPS 
forecasts, but the manipulations do not explain significant variation in LTG (smallest p = 0.08). We believe this 
is due to the inherent difficulty in forecasting long-term growth rates for firms with negative earnings and thus, 
LTG is not discussed further. 
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TABLE 1 
Experiment 1: Professional Analysts’ EPS Forecasts 


Panel A: Descriptive Statistics: Mean [standard deviation] 


Year 1 Year 2 
ANCOVA ANCOVA 
Adjusted Adjusted 
Initial Final Final Initial Final Final 


Condition п in EPS in EPS in EPS* in EPS in EPS in EPS* 
Control 17 0.23 0.30 0.29 0.57 0.67 0.62 
[0.18] [0.22] [0.35] [0.31] 
Generate 24 0.21 0.16 0.17 0.45 0.44 0.46 
Few [0.15] [0.20] [0.24] [0.31] 
Generate 15 0.23 0.25 0.25 0.46 0.52 0.54 
Many [0.18] [0.24] [0.27] [0.29] 


Panel B: Planned Contrast Tests of Hypotheses 
Year 1 Year 1 Year 2 Year 2 


Hypothesis Contrast t-statistic p-value t-statistic p-value 

H1: Lower relative optimism for generate few versus 2.09 0.02 1.92 0.03 
control 

H2: No difference in relative optimism for generate 0.68 0.50 0.95 0.35 


many versus control 


* The final forecast increase in EPS is adjusted statistically for the initial forecast increase in EPS by including 
this as a covariate in the ANCOVA model. The adjusted means are reported here. 
Panel A provides descriptive statistics by experimental condition. "Year г final forecast increase in EPS” is our 
dependent variable and is defined as the final (post-manipulation} forecast for year г (t = 1 or 2) less the year 0 
EPS of —$0.64 as provided to participants. In our analyses, we adjust for the influence of participants’ initial 
(pre-manipulation) forecasts using analysis of covariance (ANCOVA) and thus, we report the ANCOVA least 
squares adjusted means in Panel A. Participants made initial EPS forecasts after reading summarized financial 
information and made final EPS forecasts after reading a conference-call excerpt (which presented information 
about managers’ plans to improve future earnings in scenario structure) and generating counter-explanations (if 
applicable). Participants in the generate few (generate many) condition were asked to list 2 (12) reasons why 
managers’ plans could fail before making final forecasts. Participants in the control condition were not asked to 
generate counter-explanations before making final forecasts. 
Panel B provides tests of hypotheses comparing the forecast increases in EPS of participants in experimental 
conditions to those of participants not asked to generate counter-explanations (i.e., participants in the control 
condition) to examine whether the availability of counter-explanations (i.e., the ease or difficulty experienced 
during the counter-explanation task) affects the relative amount of scenario-induced optimism. If the forecast 
increases in EPS in experimental conditions are less than the forecast increases in EPS in the control condition, 
then experimental condition participants are relatively less optimistic. P-values are one-tailed for the directional 
prediction in H1 and two-tailed for H2. | 


Results 
Preliminary Analyses and Manipulation Check 

We exclude three participants’ forecast increases in EPS from the analyses because they 
differed from sample means by more than four standard deviations, and diagnostic proce- 
dures indicated that they are influential outliers. Reported analyses use the remaining 56 
observations. 
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We use forecasts made by control condition participants as our baseline for measuring 
changes in scenario-induced optimism. We expect forecasts made by participants who con- 
template managers’ plans to improve future firm performance in scenarios to exhibit opti- 
mism (as in Sedor [2002]). Consistent with this, control condition participants gave final 
forecast increases in EPS after reading the conference-call excerpt that exceeded their initial 
forecast increases in EPS by $0.07 for year 1 (t,, = 1.39, р = 0.09) and by $0.10 (tis 
= 2.01, p = 0.03) for year 2.!! 

In a post-experimental question, we asked participants who generated counter- 
explanations to rate how difficult it was for them to do so using a scale ranging from 1 
(not at all difficult) to 9 (extremely difficult). Consistent with a successful manipulation, 
the difficulty ratings provided by participants asked to generate 12 counter-explanations 
were significantly greater than the difficulty ratings provided by participants asked to gen- 
erate two counter-explanations (means [standard deviations] = 3.67 [2.16] and 1.88 [1.30] 
for the generate many and generate few conditions, respectively; t4, = 3.25, р < 0.01). 


Tests of Hypotheses 

Table 1, Panel B reports results of the planned contrasts used to test our hypotheses. 
Participants asked to generate few counter-explanations gave significantly less optimistic 
forecast increases in EPS than did control condition participants (+; = 2.09, p = 0.02 for 
year 1 and t44 = 1.92, р = 0.03 for year 2), supporting H1. Participants asked to generate 
many counter-explanations gave forecast increases in EPS that did not differ from those of 
control condition participants (t,, = 0.68, р = 0.50 for year 1 and t,, = 0.95, р = 0.35 for 
year 2), supporting Н2.13 

Theory predicts that the availability of counter-explanations (i.e., the ease or difficulty 
of their generation) will influence the effectiveness of the counter-explanation task, rather 
than the number of counter-explanations generated per se. Thus, we repeat the main anal- 
yses using a median split of the difficulty of generation measure (manipulation check) as 
the independent variable.^ Participants reporting that it was relatively easy to generate 
counter-explanations gave marginally less optimistic forecast increases in EPS than did 
control condition participants for year 1 (adjusted means — 0.29 for control and 0.20 for 
lower difficulty; tj, = 1.45, p = 0.08) and significantly less optimistic forecas: increases 
in EPS for year 2 (adjusted means = 0.62 for control and 0.41 for lower difficulty; tj, 
— 2.61, p « 0.01), providing partial support for H1. Participants reporting higher difficulty 
on the counter-explanation task gave forecast increases in EPS that did not differ “rom those 


11 Control condition participants’ unadjusted mean final forecast increases in EPS for years 1 and 2 ere $0.30 and 
$0.67, respectively. Despite the changes in analysts’ environments over time (such as penalties Zor excessive 
optimism [Smith et al. 20031), forecasts of control condition participants are consistent with unadiusted means 
reported for Sedor's (2002) loss-firm/scenario-structure condition participants ($0.32 and $0.72, fo: years 1 and 
2, respectively). This suggests that control condition participants’ forecasts exhibit optimism, in particular for 
year 2 (the year for which Sedor found significant effects). 

1? These difficulty ratings differ significantly between experimental conditions, but both means are at the low end 
of the scale. This may be the result of participants' ability to access the financial information and the conference- 
call excerpt (via electronic links) while generating counter-explanations. To the extent that analysts in both 
conditions easily generated the requested number of counter-explanations, this would bias against finding sig- 
nificant differences in analysts' forecasts across experimental conditions. 

13 Inferences are unchanged when we exclude initial forecast increases in EPS from the model. 

14 We perform a median split instead of a mediation analysis because we lack difficulty ratings for participants in 
the control condition. Control condition participants were not asked to generate counter-explanations and so 
were not asked to assess the difficulty of doing so. The modal response to the difficulty item was | (on a nine- 
point scale), the median was 2, and we classified responses of less or equal to the median as “lower” difficulty. 
With the responses clustered at the bottom of the scale, there is limited variability in this measure, which biases 
against finding results consistent with our hypotheses and leads our cell sizes to be unbalanced. 
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of control condition participants for either year (adjusted means = 0.29 and 0.19 for control 
and higher difficulty, respectively, t = 1.54, р = 0.13 for year 1; and adjusted means 
= 0.62 and 0.63 for control and higher difficulty, respectively, t < 0.01, р = 0.94 for 
year 2), supporting H2. 


Tests of Alternative Explanations 

We perform supplemental tests to rule out potential alternative explanations for the 
results. First, to ensure that the finding of no difference between the control and generate 
many conditions was not due to small sample sizes, we estimate contrasts for 1,000 boot- 
strap samples and determine the resulting p-values using the percentile method. Each 
bootstrap sample was generated by sampling (randomly with replacement) 15 observations 
within each cell of the design (the smallest of the original cell sizes). For both one-year- 
and two-year-ahead forecast increases in EPS, inferences are identical to those reported for 
our main tests of H1 and H2. 

Second, we confirmed using analysis of variance that there are no significant relation- 
ships between experimental condition and perceptions about task realism, perceptions about 
the extent to which generating reasons why managers' plans could fail is a routine part of 
the forecasting process, beliefs about managerial competence, beliefs about managerial 
honesty, or perceptions regarding the importance of providing favorable or accurate fore- 
casts (p-values > 0.27 for all omnibus F-tests). Therefore, differences in these measures 
do not explain reported results. 

Third, although the ineffectiveness of counter-explanation in the generate many con- 
dition is consistent with H2, this result could also be consistent with analysts asked to 
generate 12 counter-explanations producing lower quality counter-explanations than ana- 
lysts asked to generate two counter-explanations. To address this possibility, we analyze 
the content of participants' counter-explanations. We asked two individuals with significant 
accounting and valuation consulting experience to code independently the 226 counter- 
explanations provided by participants into content categories (e.g., cannibalization of pre- 
mium wine sales; changing consumer preferences). One individual identified 20 unique 
content categories, and both individuals coded participants' counter-explanations into those 
categories. Cohen's Kappa is 0.76, indicating high to very high initial agreement (Stokes 
et al. 1995). The coders met to reconcile differences. We use the reconciled codings for all 
analyses. 

We calculate the percentage frequency with which counter-explanations from each cat- 
egory were listed by all participants and use this frequency measure as a proxy for counter- 
explanation quality. For instance, if a category occurred 20 times, then we assign each 
counter-explanation from that category the percentage frequency of 20/226 — 8.85 percent. 
Next, for each participant, we calculate the mean percentage frequency for the counter- 
explanations he or she generated (ie., averaging percentage frequencies for the two 
counter-explanations generated by participants in the generate few condition and for the 
12 counter-explanations generated by participants in the generate many condition). Means 
(standard deviations) for participants in the generate few and generate many conditions are 
7.16 (2.67) and 6.99 (1.16), respectively. There are no differences in this measure between 
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experimental conditions (t,, = 0.28, p = 0.78, two-tailed) and thus, differences in counter- 
explanation quality are unlikely to explain our results.!* 

This test is a conservative one, in that analysts in the generate many condition produced 
six times as many counter-explanations as did analysts in the generate few condition. Thus, 
the total body of evidence considered by analysts in the former condition would likely be 
larger than the total body of evidence considered by analysts in the latter condition regard- 
less of its average quality. Indeed, the counter-explanations provided by participants in the | 
generate many condition covered a mean of 8.9 content categories (ranging from five to 
12), while the counter-explanations provided by participants in the generate few condition 
covered a mean of 2.0 content categories.'é As a result, the counter-explanation content 
considered by analysts in the generate many condition exceeded that considered by analysts 
in the generate few condition. 


IV. EXPERIMENT 2 

In Experiment 2, we replicate Experiment 1 and provide additional corroborating evi- 
dence that Experiment 1 results are caused by differences in the availability of counter- 
explanations (i.e., the ease or difficulty of their generation) and not by potential differences 
in counter-explanation content. We designed Experiment 2 to disentangle the content of the 
counter-explanations from their availability. Specifically, we added two new experimental 
conditions that mirror the content of the counter-explanations from the generate conditions 
while ensuring that all counter-explanations (regardless of number) are equally available to 
participants (Wanke et al. 1996). In particular, we asked participants in the new experimental 
conditions to read the counter-explanations generated by other participants (in identical 
form and order) instead of generating counter-explanations themselves. 

If Experiment 1 results are due to differences in the content of the counter-explanations, 
then those same results should be observed in the new experimental conditions: reading 
few counter-explanations reduces forecast optimism, whereas reading тапу counter- 
explanations does not. In contrast, if Experiment 1 results are due to differences in the 
availability of the counter-explanations (i.e., the ease or difficulty of their generation), then 
those results should not be mirrored in the new read conditions. The read task ensures that 
counter-explanations are easily available to these participants, regardless of whether their 
matched participants generated few or many counter-explanations. Thus, we expect that 
reading few and reading many counter-explanations will reduce forecast optimism and make 
the following unique predictions for Experiment 2: 


15 Similarly, order effects could influence the results if participants in the generate many condition did not consider 
all of their counter-explanations when making their EPS forecasts, but instead, focused on only the first or last 
few counter-explanations. However, we find no significant differences in the average frequencies for the two 
counter-explanations generated by participants in the generate few condition when compared to either the first 
two or the last two counter-explanations generated by participants in the generate many condition (із, = —0.91, 
р = 0.37 and і = 1.21, p = 0.23, two-tailed, for the first two and last two counter-explanations generated, 
respectively). Therefore, differences in counter-explanation order are unlikely to be driving results. 

1$ Participants in the generate many condition provided similar counter-explanations to those provided by partic- 
ipants in the generate few condition, along with additional items. Although participants in the generate few 
(generate many) condition used a total of 14 (12) of the 20 content categories in generating their two (first two) 
counter-explanations, seven of these categories overlap between conditions and account for 74 percent of the 
first two counter-explanations generated. This percentage does not vary significantly between experimental con- 
ditions, with a mean 1.38 (1.53) of the two (first two) counter-explanations provided in the generate few (generate 
many) conditions coming from these seven categories (t,, = 0.75, p = 0.46). The counter-explanations provided 
by participants in the generate-many condition ultimately covered all 20 content categories. 
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H3: Analysts who are asked to read few counter-explanations of managers’ plans to 
improve future earnings will make earnings forecasts that are less optimistic than 
the earnings forecasts made by analysts who are not asked to generate or read 
counter-explanations. 


НА: Analysts who are asked to read many counter-explanations of managers’ plans to 
improve future earnings will make earnings forecasts that are less optimistic than 
the earnings forecasts made by analysts who are not asked to generate or read 
counter-explanations. 


Design and Method 

In Experiment 2, we further explore whether the availability of counter-explanations 
influences whether a counter-explanation task mitigates scenario-induced optimism (i.e., we 
replicate Experiment 1, H1 and H2), and we examine whether reading another participant's 
counter-explanations reduces optimism (1.е., we test НЗ and H4). We manipulate the in- 
dependent variables in a (2 X 2) + 1 (control) between-subjects experiment with the task 
(generate or read counter-explanations) and the number of counter-explanations (few or 
many) as independent variables. Control condition participants neither generate nor read 
counter-explanations. 


Participants | 

Participants are 130 second-year students enrolled in a managerial accounting class in 
a BusinessWeek 'Top 25 M.B.A. program. Participants were recruited via in-class announce- 
ments and assigned to experimental conditions using a predetermined randomization plan. 
We made a $1,000 contribution to the M.B.A. class gift as a token of our appreciation. 
Approximately 89 percent of participants report having previous personal investment ex- 
perience. Participants report having taken an average of four university-level accounting 
classes and three university-level finance classes." M.B.A. students are appropriate partic- 
ipants for this experiment because the effects of availability on judgment are not expected 
to depend on characteristics specific to analysts (see Peecher and Solomon 2001; Libby et 
al. 2002). Additionally, we replicate Experiment 1, H1 and H2 as originally demonstrated 
with analysts before providing additional evidence that the availability of counter- 
explanations explains results (i.e., testing H3 and H4). 


Case Materials and Procedures 

The case materials are similar to those used in Experiment 1. One exception is that the 
net losses and EPS for the current and prior two years are smaller (e.g., current (year 0) 
EPS is —$0.08 in Experiment 2, instead of the —$0.64 reported in Experiment 1). We 
reduce the level of the losses reported in Experiment 2 to increase our ability to generalize 
the results.? Another difference from Experiment 1 is the independent variable manipula- 
tions discussed below. The experimental procedures are otherwise identical to those used 


" One participant provided inappropriate responses to the forecast question (providing percentages instead of dollar 
amounts) and one participant failed to answer post-task questions. We eliminated these two observations. Eight 
M.B.A. students who were not enrolled in the managerial accounting class (because they had previous course- 
work and were taking a shortened program) but were in the same graduating class asked to participate. They 
are included in the total of 130. All reported inferences are identical if their data are omitted. Inferences are 
also unchanged if data from participants with no prior investment experience are eliminated. 

18 Although this change increases our ability to generalize the results, it may limit participants’ forecast optimism. 
To the extent that scenaric-induced optimism is greater for loss firms (Sedor 2002), this change in materials 
would limit our ability to replicate Experiment 1 results and bias against finding support for the hypotheses. 
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in Experiment 1: each participant was provided with an electronic link to his or Eer assigned 
experimental condition and participants completed the study online. 


Independent Variables 

We asked participants in experimental conditions either to generate counter- 
explanations or to read the counter-explanations generated by other participants after they 
read the conference-call excerpt, but before they made their final forecasts. Control con- 
dition participants were not asked to generate or read counter-explanations and made final 
forecasts immediately after reading the conference-call excerpt. Specifically, participants in 
the generate few and the generate many conditions received the following additional 
instruction: 

Before moving on to the next section, please list TWO (TEN) thoughts about how 

future financial performance might not be as positive as Villa Fiore ехрес:5. In other 

words, how might Villa Fiore’s plans to increase sales and profitability fail? 


The online materials included either two or ten blank lines for participants’ counter- 
explanations, depending on experimental condition. To address the possibility that Exper- 
iment 1 results depended on participants struggling to generate the full number of 12 
counter-explanations requested before proceeding, we reduced the number of counter- 
explanations requested in the generate many condition in Experiment 2 to ten, and the 
instrument was programmed such that participants were required to input a minimum of 
two counter-explanations before continuing. Allowing participants in the generate many 
condition to continue without generating all ten counter-explanations could reduce the ex- 
perienced difficulty of the task, thus reducing the strength of our manipulation end biasing 
against finding support for the hypotheses. 

Participants in the read few (read many) conditions received the following additional 
instruction: 


Before moving on to the next section, please review the thoughts listed by another 
analyst about how future financial performance might not be as positive as Villa Fiore 
expects. In other words, this analyst considered how Villa Fiore’s plans to increase 
sales and profitability might fail. 


Each participant in the read conditions was provided with the set of counter-explanations 
obtained from a matched participant in the generate conditions. In other words, each read 
few (read many) participant had a unique set of experimental materials that reproduced, in 
the same form and order, the counter-explanations created by a matched generate few 
(generate many) participant. This yoked design ensured that the quality and order of 
counter-explanations were identical between the generate and read few conditions as well 
as between the generate and read many conditions. However, the matching procedure ne- 
cessitated collecting data in stages. Our initial invitations to participate were distributed to 
only a portion of the participants pre-assigned to the control and generate conditions. Sub- 
sequent invitations included participants pre-assigned to all five conditions. Cne conse- 
quence of collecting data in stages is that we were unable to ensure equal cell sizes or to 
match every participant in the generate conditions with a participant in the corresponding 
read condition. Thus, we exclude from analysis data provided by 13 generate condition 
participants for whom we did not have matched read condition participants. 


Dependent Measures 
Participants provided initial and final forecasts of annual EPS for one year and two 
years ahead. Consistent with Experiment 1, we subtract current (year 0) EPS of —$0.08 


The Accounting Review, March 2006 


Using Counter-Explanation to Limit Analysts’ Forecast Optimism 391 


from participants’ EPS forecasts to arrive at participants’ forecast increases in EPS. The 
dependent measures are participants’ final forecast increases in EPS for both one year (year 
1) and two years ahead (year 2). We use participants’ initial forecast increases in EPS as 
covariates in the analyses. Table 2, Panel A reports descriptive statistics for Experiment 2, 
including means and least squares (adjusted) means for final forecast increases in EPS for 
years 1 and 2. 


Results 
Preliminary Analyses and Manipulation Check 

We exclude three participants’ forecast increases in EPS from the analyses because they 
differed from sample means by more than four standard deviations, and diagnostic proce- 
dures indicated that they are influential outliers. We also exclude the data provided by the- 
three participants matched to these participants to preserve the yoked experimental design. 
Reported analyses use the remaining 111 observations. 

Participants in the generate many condition listed significantly more counter- 
explanations (i.e., reasons why managers’ plans might fail) than did participants in the 
generate few condition (means = 7.09 and 2.00 for the generate many and few conditions, 
respectively; Їл, = 8.21, p < 0.01). Participants asked to generate ten counter-explanations 
also assigned marginally higher difficulty ratings to the task than did those participants 
asked to generate two counter-explanations (means [standard deviations] = 4.72 [2.10] and 
3.85 [1.95] for the generate many and generate few conditions, respectively; ty = 1.40, p 
= 0.08). 


Tests of Hypotheses 

Table 2, Panel B reports results of the planned contrasts used to test our hypotheses. 
Consistent with H1, participants asked to generate few counter-explanations gave marginally 
less optimistic forecast increases in EPS than did control condition participants for year 1 
(tios = 1.55, р = 0.06) and gave significantly less optimistic forecast increases in EPS than 
did control condition participants for year 2 (tios = 2.01, p = 0.02). Consistent with H2, 
participants asked to generate many counter-explanations gave forecast increases in EPS 
that did not differ from those of control condition participants (tios = 0.62, р = 0.54 and 
tios = 1.05, р = 0.30 for year 1 and year 2, respectively).'? 

We repeat the main analyses using a median split of the difficulty of generation measure 
(manipulation check) as the independent variable to corroborate the results. Participants 
reporting lower difficulty (i.e., 4 or less on a nine-point scale) gave significantly less op- 
timistic forecast increases in EPS than did control condition participants for both year 1 
and year 2 (adjusted means = 0.16 and 0.10 for control and lower difficulty, t. = 2.02, p 
= 0.02; adjusted means for year 2 = 0.26 and 0.18 for control and lower difficulty, tęs 
= 1.78, p = 0.04), supporting Hl. Participants reporting higher difficulty gave forecast 
increases in EPS that did not differ from those of control condition participants (adjusted 
means for year 1 = 0.16 and 0.15 for control and higher difficulty, t, = 0.35, p = 0.73; 
adjusted means for year 2 = 0.26 and 0.21 for control and higher difficulty, tes = 1.17, p 
— 0.25), supporting H2. 

Next, we test the unique predictions for Experiment 2—that participants who read both 
few (H3) and many (H4) counter-explanations will make less optimistic EPS forecasts than 
will control condition participants. Both hypotheses are partially supported for year 1 and 


19 Applying the bootstrapping technique as described for Experiment 1 improves results slightly in that the НІ 
contrast is significant for both years 1 and 2 (p « 0.05), while the H2 contrast remains insignificant (p > 0.10). 
Inferences for H3 and H4 are identical to those reported below. 
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TABLE 2 
Experiment 2: M.B.A. Students’ EPS Forecasts 


Panel A: Descriptive Statistics: Mean [standard deviation] 


Year 1 Year 2 
ANCOVA ANCOVA 
Adjusted Adjusted 
Initial Final Final Inftial Final Final 


Forecast Forecast Forecast Forecast Forecast Forecast 
Increase Increase Increase Increase Increase Increase 
Condition п in EPS in EPS in EPS* in EPS in EPS in EPS? 


Control 27 0.15 0.14 0.15 0.27 0.25 0.25 
[0.13] [0.16] [0.23] [0.26] 

Generate 20 0.23 0.13 0.09 0.34 0.20 0.16 
Few [0.47] [0.27] [0.52] [0.30] 

Generate 22 0.17 0.13 0.13 0.26 0.20 0.20 
Many [0.47] [0.32] [0.60] [0.43] 

Read Few 20 0.10 0.06 0.10 0.18 0.10 0.16 
[0.16] [0.13] [0.14] [0.15] 

Read 22 0.14 0.09 0.10 0.30 0.16 0.14 
Many [0.11] [0.15] [0.24] [0.23] 


Panel B: Planned Contrast Tests of Hypotheses 
Year 1 Year 1 Year 2 Year 2 


Hypothesis Contrast t-statistic p-value t-statistic p-value 

H1: Lower relative optimism for generate few versus 1.55 0.06 2.01 0.02 
control 

H2: No difference in relative optimism for generate 0.62 0.54 1.05 0.30 
many versus control 

H3: Lower relative optimism for read few versus 1.41 0.08 1.98 0.03 
control 

H4: Lower relative optimism for read many versus 1.51 0.07 2.37 0.01 
control 


* The final forecast increase in EPS is adjusted statistically for the initial forecast increase in EPS bv including 
this as a covariate in the ANCOVA model. The adjusted means are reported here. 
Panel A provides descriptive statistics by experimental condition. “Year г final forecast increase in EPS” is our 
dependent variable and is defined as the final (post-manipulation) forecast for year t (t = 1 or 2) less the year 0 
EPS of —$0.08 as provided to participants. In our analyses, we adjust for the influence of participants’ initial 
(pre-manipulation) forecasts using analysis of covariance (ANCOVA) and thus, we report the ANCOVA least 
squares adjusted means in Panel A. Participants made initial EPS forecasts after reading summarized financial 
information and made final EPS forecasts after reading a conference-call excerpt (which presented information 
about managers’ plans to improve future earnings in scenario structure) and generating counter-explanations (if 
applicable). Participants in the generate few (generate many) condition were asked to list 2 (10) reasons why 
managers' plans could fail before making final forecasts. Participants in the read few (read many) condition were 
asked to read the reasons why managers' plans might fail that were produced by matched participants in the 
generate few (generate many) condition. Participants in the control condition were not asked to generate or read 
counter-explanations before making final forecasts. 
Panel B provides tests of hypotheses comparing the forecast increases in EPS of participants in experimental 
conditions to those of participants not asked to generate or read counter-explanations (і.е., participants in the 
control condition) to examine (1) whether the availability of counter-explanations (i.e., the ease or difficulty 
experienced during the counter-explanation task) affects the relative amount of scenario-induced optimism (НІ 
and H2) and (2) whether the observed effect is driven by availability and not by variations in counter- 
explanation content between experimental conditions (H3 and H4). If the forecast increases in EPS in 
experimental conditions are less than the forecast increases in EPS in the coatrol condition, then experimental 
condition participants are relatively less optimistic. P-values are one-tailed far directional predictions (H1, H3, 
and H4) and two-tailed for H2. 
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supported for year 2. Participants asked to read few counter-explanations gave marginally 
less optimistic forecast increases in EPS than did control condition participants for year 1 
(tios = 1.41, p = 0.08) and gave significantly less optimistic forecast increases in EPS than 
did contro] condition participants for year 2 (tios = 1.98, р = 0.03). Likewise, participants 
asked to read many counter-explanations gave marginally less optimistic forecast increases 
in EPS than did control condition participants for year 1 (tios = 1.51, р = 0.07) and gave 
significantly less optimistic forecast increases in EPS than did control condition participants 
for year 2 (tios = 2.37, р = 0.01). Taken together, results support that the availability of 
counter-explanations is an important determinant of whether counter-explanation reduces 
scenario-induced forecast optimism. 


V. CONCLUSION 

In this paper, we examine whether counter-explanation can be used to reduce scenario- 
induced optimism in analysts’ forecasts. In two experiments, we find that the EPS forecasts 
of professional analysts (Experiment 1) and M.B.A. students (Experiment 2) are influenced 
by the availability of (1.е., the ease of generating) counter-explanations. Consistent with 
availability, participants in conditions in which generating counter-explanations was rela- 
tively easy made lower EPS forecasts than did control condition participants. Participants 
in conditions in which generating counter-explanations was relatively difficult made EPS 
forecasts that did not differ from the forecasts made by control condition participants. In 
Experiment 1, we perform supplemental tests to rule out alternative explanations and thus, 
provide evidence suggesting that differences in the ability of counter-explanation tasks 
to reduce forecast optimism are caused by differences in the availability of counter- 
explanations. 

In Experiment 2, we provide additional evidence that results are consistent with avail- 
ability influencing counter-explanation’s effectiveness. We provided some participants with 
the counter-explanations generated by other participants (in identical form and order) in- 
stead of having these participants generate counter-explanations themselves. Because read- 
ing many counter-explanations is at least as effective in reducing optimism as is reading 
few counter-explanations, we infer that it is the availability of (i.e., the ease of accessing) 
counter-explanations that drives the reduction in relative optimism observed in Experiments 
1 and 2. 

This research is the first that we know of to investigate counter-explanation as a means 
of reducing cognitive biases associated with predictions made by experts. Because managers 
are likely to continue using scenarios to describe their plans to improve future firm per- 
formance, analysts and others who rely on this information when developing forecasts, but 
wish to avoid optimism, could use a simple counter-explanation task to avoid optimistic 
predictions. | 

Our results also suggest that analysts do not appear to consider spontaneously a small 
number of counter-explanations when initially contemplating managers’ scenarios. To the 
extent that scenario-induced optimism is manifest in analysts’ forecasts of annual earnings, 
analysts’ failure to consider counter-explanations of their own accord may be one factor 
sustaining optimism. 

Finally, we demonstrate a boundary condition for the use of counter-explanation as a 
debiasing mechanism. In both experiments, we find that the availability of counter- 
explanations (1.е., the ease or difficulty of their generation) influences the ability of this 
mechanism to reduce scenario-induced optimism in a prediction task. Results suggest that 
when implementing counter-explanation as a debiasing mechanism, it is important to be 
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aware that a lack of available counter-explanations can dampen the mechanism’s effective- 
ness. A limitation of the study is that using a hypothetical firm precludes us from calculating 
an absolute measure of forecast optimism and thus, we are unable to determine whether 
counter-explanation completely eliminates forecast optimism. 

Optimism in analysts’ long-term forecasts likely is costly to both investors and securities 
firms. Prior empirical research suggests that correcting for analysts’ forecast optimism can 
improve value estimates (Frankel and Lee 1998). Recently, a number of Wall Street firms 
agreed to a $1.4 billion settlement for charges by the SEC, NASD, and NYSE, among 
others, that their analysts routinely issued overly optimistic stock research to investors in 
order to curry favor with corporate clients and to win investment banking business (Smith 
et al. 2003). Although these charges presumably relate to forecast optimism driven by 
economic incentives, addressing more subtle sources of forecast optimism is likely to be- 
come of increasing concern to regulators, investors, and Wall Street analysts. Our study 
introduces a theory-based, yet practical way of reducing scenario-induced optimism in an- 
alysts’ annual earnings forecasts. 


REFERENCES 

Abarbanell, J. S., and У. L. Bernard. 1992. Tests of analysts’ overreaction/underreaction to earnings 
information as an explanation for anomalous stock price behavior. The Journal of Finance 67 
(3): 1181-1207. 

— and В. Lehavy. 2003. Can stock recommendations predict earnings management and analysts’ 
earnings forecast errors? Journal of Accounting Research 41 (1): 1-31. 

Affleck-Graves, J., L. R. Davis, and R. R. Mendenhall. 1990. Forecasts of earnings per share: Possible 
sources of analyst superiority and bias. Contemporary Accounting Research 6 (2): 501—517. 

Ali, A., A. Klein, and J. Rosenfeld. 1992. Analysts' use of information about permanent and transitory 
earnings components in forecasting annual EPS. The Accounting Review 67 (1): 183-198. 

, L. Hwang, and M. A. Trombley. 2003. Residual-income-based valuation predicts future stock 
returns: Evidence on mispricing vs. risk explanations. The Accounting Review 78 (2): 377-396. 

Arkes, H. R. 1991. Costs and benefits of judgment errors: Implications for debiasing. Psychological 
Bulletin 110 (3): 486—498. 

Bonner, S., R. Libby, and M. W. Nelson. 1996. Using decision aids to improve auditors' conditional 
probability judgments. The Accounting Review 71 (2): 221—240. 

Bradshaw, M. T., S. A. Richardson, and R. G. Sloan. 2003. Pump and dump: An empirical analysis 
of the relation between corporate financing activities and sell-side analyst research. Working 
paper, Harvard University, University of Pennsylvania, and University of Michigan. 

. 2004. How do analysts use their earnings forecasts in generating stock recommendations? 
The Accounting Review 79 (1): 25—50. 

Brown, L. D. 1997. Analyst forecasting errors: Additional evidence. Financial Analysts Journal 
(November/December): 81—88. 

. 2001. A temporal analysis of earnings surprises: Profits versus losses. Journal of Accounting 
Research 39 (2): 221-241. 

Darrough, M. N., and T. Russell. 2002. A positive model of earnings forecasts: Top down versus 
bottom up. Journal of Business 75 (1): 127-152. 

DeBondt, W. F. M., and R. H. Thaler. 1990. Stock market volatility: Do security analysts overreact? 
American Economic Review 80 (2): 52—57. 

Dechow, P. M., A. P. Hutton, and R. G. Sloan. 1999. An empirical assessment of the residual income 
valuation model. Journal of Accounting and Economics 26: 1—34. 

| , and . 2000. The relation between analysts’ forecasts of long-term earnings 

growth and stock price performance following equity offerings. Contemporary Accounting Re- 

search 17 (1-3): 1-32. 




















The Accounting Review, March 2006 


Using Counter-Explanation to Limit Analysts’ Forecast Optimism 395 


Dowen, R. J. 1996. Analyst reaction to negative earnings for large well-known firms. The Journal of 
Portfolio Management 23 (1): 49—55. 

Dugar, A., and S. Nathan. 1995. The effect of investment banking relationships on financial analysts’ 
earnings forecasts and investment recommendations. Contemporary Accounting Research 12 
(1): 131-160. 

Duru, A., and D. M. Reeb. 2002. International diversification and analysts’ forecast accuracy and bias. 
The Accounting Review 77 (2): 415—433. 

Eames, M., S. M. Glover, and J. Kennedy. 2002. The association between trading recommendations 
and broker-analysts’ earnings forecasts. Journal of Accounting Research 40 (1): 85—104. 
———, and ——. 2003. Earnings predictability and the direction of analysts’ earnings forecast 

errors. The Accounting Review 78 (3): 707—724. 

Elgers, Р. Т., and M. Н. Lo. 1994. Reductions in analysts’ annual earnings forecast errors using 
information in prior earnings and security returns. Journal of Accounting Research 32 (2): 290— 
303. 

Francis, J., and D. Philbrick. 1993. Analysts’ decisions as products of a multi-task environment. 
Journal of Accounting Research 31 (2): 216—230. 

Frankel, R., and C. M. C. Lee. 1998. Accounting valuation, market expectation, and cross-sectional 
stock returns. Journal of Accounting and Economics 25 (3): 283-319. 

Graham, 7. R., С. К. Harvey, and S. Rajgopal. 2005. The economic implications of corporate financial 
reporting. Journal of Accounting and Economics 40 (1-3): 3-73. 

Heiman, V. B. 1990. Auditors’ assessments of the likelihood of error explanations in analytical review. 
The Accounting Review 65 (4): 875—890. 

Hoch, S. J. 1984. Availability and interference in predictive judgment. Journal of Experimental Psy- 
chology: Learning, Memory, and Cognition 10 (4): 649-662. 

Hong, H., and J. D. Kubik. 2003. Analyzing the analysts: Career concerns and biased earnings fore- 
casts. The Journal of Finance 58 (1): 313-351. 

Hunton, J. E., and R. A. McEwen. 1997. An assessment of the relation between analysts’ earnings 
forecast accuracy, motivational incentives and cognitive information search strategy. The Ac- 
counting Review 72 (4): 497-515. 

Irvine, Р. 2004. Analysts’ forecasts and brokerage-firm trading. The Accounting Review 79 (1): 125— 
149. 

Jacob, J., S. Rock, and D. P. Weber. 2003. Do analysts at independent research firms make better 
earnings forecasts? Working paper, University of Colorado at Boulder. 

Kahneman, D., and A. Tversky. 1982. The simulation heuristic. In Judgment Under Uncertainty: 
Heuristics and Biases, edited by D. Kahneman, P. Slovic, and A. Tversky. Cambridge, U.K.: 
Cambridge University Press. 

—  —, and D. Lovallo. 1993. Timid choices and bold forecasts: A cognitive perspective on risk 
taking. Management Science 39 (1): 17-31. 

Kennedy, J. 1993. Debiasing audit judgment with accountability: A framework and experimental 
results. Journal. of Accounting Research 31 (2): 231-245. 

. 1995. Debiasing the curse of knowledge in audit judgment. The Accounting Review 70 (2): 
249-273. 

Kim, C., and S. Lustgarten. 1998. Broker-analysts’ trade-boosting incentive and their earnings forecast 
bias. Working paper, Queens College of The City University of New York and Baruch College 
of The City University of New York. 

Klein, A. 1990. A direct test of the cognitive bias theory of share price reversals. Journal of Accounting 
and Economics 13 (2): 155-166. 

Koonce, L. 1992. Explanation and counter-explanation during audit analytical review. The Accounting 
Review 67 (1): 59-76. 

Krishnan, M., and K. Sivaramakrishnan. 1998. A theory of analysts’ forecast bias. Working paper, 
Rutgers University and Texas A&M University. 

Lee, С. M. C., J. Myers, and B. Swaminathan. 1999. What is the intrinsic value of the Dow? The 
Journal of Finance 54 (5): 1693-1741. 








The Accounting Review, March 2006 


396 Kadous, Krische, and Sedor 


Libby, R., R. Bloomfield, and M. W. Nelson. 2002. Experimental research in financial accounting. 
Accounting, Organizations and Society 27 (8): 775—810. 

Lim, T. 2001. Rationality and analysts' forecast bias. The Journal of Finance 56 (1): 369—385. 

Lin, H., and M. F. McNichols. 1998. Underwriting relationships, analysts’ earnings forecasts and 
investment recommendations. Journal of Áccounting and Economics 25 (1): 101—127. 

Liu, J., and J. Thomas. 2000. Stock returns and accounting earnings. Journal of Accounting Research 
38 (1): 71-101. 

Loh, R., and С. М. Mian. 2005. Do accurate earnings forecasts facilitate superior investment rec- 
ommendations? Journal of Financial Economics (forthcoming). 

Matsumoto, D. A. 2002. Management’s incentives to avoid negative earnings surprises. The Account- 
ing Review 77 (3): 483—514. 

McNichols, M., and P. C. O'Brien. 1997. Self-selection and analyst coverage. Journal of Accounting 
Research 35 (Supplement): 167—199. 

Michaely, R., and K. L. Womack. 1999. Conflict of interest and the credibility of underwriter analyst 
recommendations. Review of Financial Studies 12 (4): 653—686. 

Mikhail, M. B., B. R. Walther, and R. H. Willis. 1999. Does forecast accuracy matter to security 
analysts? The Accounting Review 74 (2): 185-200. 

National Investor Relations Institute (NIRI). 2003. NIRI issues survey results on earnings guidance 
practices; First Call eliminates the term “consensus.” Executive Alert: Updates on Issues Vital | 
to the Practice of Investor Relations. Available at: http:/ / www.niri.org/irresource.. pubs/alerts 
/ea20031210.pdf. 

Nelson’s Directory of Investment Research. 2003. Port Chester, NY: W.R. Nelson & Co. 

Ohlson, J. A., and B. E. Juettner-Nauroth. 2005. Expected EPS and EPS growth as determinants of 
value. Review of Accounting Studies 10 (2-3): 349—365. 

Peecher, M. E., and I. Solomon. 2001. Theory and experimentation in studies of audit judgments and 
decisions: Avoiding common research traps. International Journal of Auditing 5 (3): 193—203. 

Richardson, S., S. Н. Teoh, and P. Wysocki. 2004. The walk-down to beatable analyst forecasts: The 
role of equity issuance and insider trading incentives. Contemporary Accounting Research 21 
(4): 885—924. 

Sanna, L. Г., М. Schwarz, and S. L. Stocker. 2002. When debiasing backfires: Accessible content and 

| accessibility experiences in debiasing hindsight. Journal of Experimental Psychology: Learning, 
Memory and Cognition 28 (3): 497—502. 

Schwarz, N., E. Strack, H. Bless, G. Klumpp, H. Rittenauer-Schatka, and A. Simons. 1991. Ease of 
retrieval as information: Another look at the availability heuristic. Journal of Personality and 
Social Psychology 61 (2): 195-202. 

. 1998. Accessible content and accessibility experiences: The interplay of declarative and 
experiential information in judgment. Personality and Social Psychology Review 2 12): 87—99. 

Sedor, L. 2002. An explanation for unintentional optimism in analysts' earnings forecasts. The Ac- 
counting Review 77 (4): 731—753. 

Smith, R., S. Craig, and D. Solomon. 2003. Wall Street firms settle charges over reseacch in $1.4 
billion pact. Wall Street Journal (April 29): A1. 

Sougiannis, T., and T. Yaekura. 2001. The accuracy and bias of equity values inferred from analysts' 
earnings forecasts. Journal of Accounting, Auditing & Finance 16 (4): 331-362. 

Stokes, M. E., C. S. Davis, and G. G. Koch. 1995. Categorical Data Analysis Using the SAS System. 
Cary, NC: SAS Institute Inc. 

Tversky, A., and D. Kahneman. 1973. Availability: A heuristic for judging frequency and »robability. 
Cognitive Psychology 5 (2): 207—232. 

Wall Street Journal Online. 2002. Coca-Cola to end practice of giving earnings forecasts. A Wall 
Street Journal Online News Roundup (December 13). 

Wanke, M., H. Bless, and B. Biller. 1996. Subjective experience versus content of information in the 
construction of attitude judgments. Personality and Social Psychology Bulletin 22 (11): 1105— 
1113. 





The Accounting Review, March 2006 


Using Counter-Explanatior. to Limit Analysts’ Forecast Optimism 397 


— С. Bohner, and А. Jurkowitsch. 1997. There are many reasons to drive a BMW: Does 
imagined ease of argument generation influence attitudes? Journal of Consumer Research 24 
(2): 170-177. 

Zuckerman, G. 2005. CEOs turn mum about projecting earnings. Wall Street Journal (March 1): C1, 
C4. 


ue DE 


2 a Mon ` 
The Accounting Review, March 2006 `, 
. 50 i 


THE ACCOUNTING REVIEW 
Vol. 81, No. 2 

2006 

pp. 399-420 


Expense Misreporting 
in Nonprofit Organizations 


Ranjani Krishnan 
Michigan State University 


Michelle H. Yetman 
University of California, Davis 


Robert J. Yetman 
University of California, Davis 


ABSTRACT: We examine whether nonprofit organizations understate fundraising ex- 
penses in their publicly available financial statements. A large body of anecdotal evi- 

` dence notes that an inexplicable number of nonprofits report zero fundraising ex- 
penses. We provide empirical evidence that the zero fundraising expense phenomenon 
is at least partly due to inappropriate reporting. We then examine to what extent these 
misreported expenses are the result of managerial Incentives. Prior research finds ап 
association between reported expenses and managerial compensation as well as the 
level of donations received. Using these findings we construct two Incentive variables 
and find a positive assoclation between misreporting behavior and managerial incen- 
tives. Our results also suggest that the use of an outside accountant reduces the prob- 
ability that a nonprofit will misreport expenses, consistent with the use of an outside 
paid accountant increasing the reliability and usefulness of nonprofit financial reports. 
Finally, we find that SOP 98-2 reduced the probability that a nonprofit will misreport 
fundraising expenses. 
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I. INTRODUCTION 


e examine whether nonprofit managers incorrectly classify costs across the cat- 
egories of functional expenses reported on their IRS Form 990 (hereafter, IRS 


990), and whether this incorrect classification is driven by managerial incentives.’ 





! The IRS 990 is the primary source of publicly available nonprofit financial information (Joint Committee on 
Taxation 2000). 


We thank two anonymous reviewers, Ashiq Ali, Leslie Eldenburg, Teresa Gordon, Angela Gore, Janet Greenlee, 

Chris Jones, Lil Mills, Mike Shields, workshop participants at the 2003 American Accounting Association meetings, 

The University of Arizona, The George Washington University, Michigan State University, and Arizona State 

University for their valuable comments and advice. 

Editor’s note: This paper was accepted by Terry Shevlin, Senior Editor. : 
Submitted June 2005. 

Accepted September 2005 


399 


400 Krishnan, Yetman, and Yetman 


The IRS 990 partitions expenses into programs (i.e., charitable), fundraising, and admin- 
istrative categories. Prior research provides evidence that two metrics based on these three 
expense classifications are of particular importance because they influence managerial com- 
pensation as well as the decisions of donors and other stakeholders (Baber et al. 2002; 
Okten and Weisbrod 2000; Posnett and Sandler 1989; Weisbrod and Dominguez 1986). 
The first metric is the program ratio, which is the amount of program expenses divided by 
total expenses. Higher program ratios indicate that a larger proportion of total expenses are 
directed toward the charitable mission. Various charity watchdog agencies as well as state 
regulators suggest that higher program ratios are indicative of more efficient and effective 
organizations. The second metric is the fundraising ratio, which is the amount of fundraising 
expenses divided by total donations. Lower fundraising ratios are suggestive of higher 
fundraising efficiency because fewer donations are consumed by the cost of fundraising 
activities. 

Concerns over the quality of nonprofit financial reporting are increasing (Yetman and 
Yetman 2004). Congress recently commissioned the Government Accountability Office 
(GAO) to conduct a study of nonprofit reporting (United States GAO 2002). The GAO 
found several reporting weaknesses, including finding that over one-half of all nonprofits 
that earn donations fail to report any fundraising expenses. Although the GAO report and 
many subsequent informal analyses (e.g., Froelich et al. 2000; Hager 2003) conclude that 
this finding is curious and inexplicable, scant empirical evidence exists that this phenom- 
enon is the result of incorrect reporting. 

We conduct two separate empirical tests to determine whether nonprofits inappropri- 
ately understate their fundraising expenses and overstate their program expenses. For the 
first test, we randomly select a sample of nonprofits that receive donations and report zero 
fundraising expenses on their IRS 990, and search their Internet web pages far evidence 
that they conduct fundraising activities. We find that nearly half of the websites we ex- 
amined show evidence of fundraising activities of the type that would give rise to reportable 
fundraising expenses that ostensibly should have been reported on the IRS 960. For the 
second test we compare a sample of audited financial statements to matched sets of IRS 
990s. We find that 38 percent of the observations that reported some fundraising expenses 
on their audited financial statements reported lower (and in many cases zero) mounts of 
fundraising on their IRS 990s, and that the mean amount of fundraising reported on the 
990 is significantly less than that reported on the financial statements. 

We next examine the firm-specific characteristics that are associated with these misal- 
located expenses. The primary goal of this analysis is to determine whether nonprofit ex- 
pense misreporting is random error, related to managerial incentives, or the result of other 
factors such as lack of expertise. We examine two sources of incentives for managers to 
misallocate expenses: to garner higher managerial compensation and to attract additional 
donations revenues. We compute firm-level incentive variables using time-series associa- 
tions between the program ratio and managers’ compensation and donations revenue. We 
find our incentive measures to be positively associated with the incidence of zero fundrais- 
ing expenses. We also find that the use of an outside accountant and accounting rule changes 
reduces the incidence of zero fundraising expenses. Our results are robust to controlling 
for expertise (as measured by size and age of the charity). 

As previously noted, a growing body of anecdotal evidence points out that an inexpli- 
cable number of nonprofits report zero fundraising expenses. Our paper makes a contri- 
bution by being among the first to provide empirical evidence that this phenomenon is due 
to incorrect reporting. A recent paper by Jones and Roberts (2006) examines if nonprofits 
misallocate the costs of combined fundraising and educational campaigns (so called “‘joint 
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costs"). They find that nonprofits over-allocate expenses to education and under-allocate 
expenses to fundraising, thus simultaneously improving their program and fundraising ra- 
tios. Although Jones and Roberts (2006) examine a different setting (ie., the subset of 
expenses classified as “‘joint costs”) using a different sample (i.e., only those observations 
that report “joint costs," which is about 2 percent of the nonprofit population), their results 
are consistent with ours and provide additional evidence that nonprofit organizations mis- 
report their financial reports. 

In addition to documenting expense misallocations by nonprofits, our paper also pro- 
vides evidence that nonprofit managers respond to incentives to manage their publicly 
available financial information by misstating the amounts of expenses, which is important 
in that it addresses the issue of whether these misallocations are due to random error, 
intentional manipulation, or some other factor. 

Our results have important implications for nonprofit researchers as they suggest that 
organizations that report zero fundraising expenses should be carefully scrutinized. De- 
pending on the type of analysis and question being asked, it might be prudent for a re- 
searcher to exclude organizations that report zero fundraising expenses from their analysis. 
In addition, our study has important policy implications. From a public policy perspective, 
to the extent donors use the disclosed figures for making their donations allocation deci- 
sions, resources are misallocated among nonprofits. Also, our study provides potentially 
useful information to governmental regulators responsible for nonprofit financial informa- 
tion accuracy, especially in light of the recent attention given to nonprofits' reported fund- 
raising activities by the courts (e.g., in May, 2003, the United States Supreme Court ruled 
that states can pursue fraud charges against fundraisers who intentionally mislead donors; 
see 538 U.S., 2003 Madigan v. Telemarketing Associates) and by the previously mentioned 
GAO report. 

The next section discusses the agency problem in nonprofit organizations and its influ- 
ence on nonprofit financial reporting. Section IJI provides evidence that nonprofits under- 
state their fundraising expenses and thereby likely overstate their program expenses. In 
Section IV we examine the characteristics that are associated with the incidence of zero 
reported fundraising expenses and attempt to determine if observed behavior is consistent 
with managerial incentives. The final section concludes and discusses the implications of 
our findings. 


П. AGENCY PROBLEMS IN NONPROFIT ORGANIZATIONS 

Because contracts usually cannot be costlessly written and enforced, most organiza- 
tions, regardless of their ownership structure, face agency problems (Fama and Jensen 1983; 
Jensen and Meckling 1976). In publicly traded firms, the agency problem arises because 
operating decisions are made by managers whose interests are not necessarily identical to 
those of the shareholders. Economists suggest that in some types of services, the nonprofit 
organizational form may help reduce these agency conflicts (Arrow 1963; Fama and Jensen 
1983; Hansmann 1996). This literature argues that nonprofit organizations arise in areas 
such as healthcare, in response to the asymmetry of information between the consumers 
and the producers of a service. To instill trust in the recipient, the providing firm uses 
the nonprofit form of organization to provide a signal that the firm is not interested in 
profit maximization, but rather is working to maximize the recipient's welfare. Using similar 
logic, Fama and Jensen (1983) conclude that, “the absence of residual claims avoids the 
donor-residual claimant agency problem and explains the dominance of nonprofits in donor- 
financed activities." 


The Accounting Review, March 2006 


402 Krishnan, Yetman, and Yetman 


However, the absence of residual claimants does not automatically ensure that nonprofit 
organizations will be free of agency problems. Managers can consume more resources than 
is required for efficient production for various reasons. For example, managers can consume 
excessive perquisites, exert lower effort, or expend resources on activities peripheral to the 
stated mission of the organization, leading to agency losses. If managers are not effectively 
monitored by donors, then the potential for agency losses increases (Hansmann 1996). 
Under these circumstances, accounting information can assist donors and other stakeholders 
in monitoring their implicit contracts by providing a means for donors to evaluate whether 
the nonprofit is using its donations in the most efficient and effective manner. 

Indeed a substantial body of research shows that donors use accounting measures to 
monitor the efficiency of a nonprofit organization to ensure stewardship of their donated 
resources and to assist them in making their donation allocation decisions (Baber et al. 
2001; Callen 1994; Okten and Weisbrod 2000; Posnett and Sandler 1989; Tinkelman 1999; 
Weisbrod and Dominguez 1986; Yetman and Yetman 2003). This literature concludes that 
donors consider the program and fundraising ratios as important measures of nonprofit 
organizational efficiency and empirically documents that donations increase as program 
ratios increase (Harvey and McCrohan 1988; Khumawala and Gordon 1997; Weisbrod and 
Dominguez 1986) and decrease as fundraising ratios increase (Okten and Weisbrod 2000). 

Because program ratios are an important factor for donors, compensating a manager 
based on program service ratios is one way to increase goal congruence between the donors 
and the managers such that when the managers maximize their utility, donors’ utility is 
also maximized, thereby reducing agency losses. Consistent with this logic, prior research 
has documented a positive relationship between program ratios and compensation (Baber 
et al. 2002). 

When accounting information is used as a basis for managerial compensation and 
donation decisions, one implication that follows is that managers would prefer to disclose 
accounting information that makes it appear as though the nonprofit is functioning effi- 
ciently. A number of prior accounting studies in the for-profit setting including Healy 
(1985), Holthausen et al. (1995), and Guidry et al. (1999) find that managers who have the 
economic incentive (such as bonus tied to financial performance) manipulate earnings to 
increase their compensation. In the following empirical analysis we first determine if non- 
profit managers misreport expense allocations in their financial reports by shifting costs 
away from fundraising and toward programs. We then test our agency predictions by ex- 
amining the association between the strength of managers’ incentives and financial state- 
ment management. 


Ш. EXPENSE MISREPORTING ON THE IRS FORM 990 
As previously noted, a large number of nonprofits report zero fundraising expenses. 
Anecdotal literature speculates that this behavior is inappropriate and perhaps even inten- 
tional, as a nonprofit can improve its program ratio by shifting costs from fundraising 
expenses to program expenses. The purpose of this portion of our analysis is to determine 
whether the widespread anecdotal speculation can be supported by empirical evidence that 
nonprofits are inappropriately presenting their financial results to the public. 
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We address the issue by investigating expense allocations on IRS 990, which is the 
principle source of nonprofit financial information. All organizations granted federal tax- 
exemption with gross receipts of more than $25,000 must file a 990.? Penalties, including 
imprisonment and loss of tax-exempt status, apply to fraudulently filed IRS 990s, although 
audit rates are low. Affiliated or related organizations are also required to file a complete 
990 unless they choose to be included in a combined return. 

IRS 990 data are available from two sources, and we use both of these databases for 
various portions of our analyses. The first is the National Center for Charitable Statistics 
(NCCS) database, which includes computer readable data for the years 1982 through 2001 
(exclusive of 1984 when no sample was taken). The NCCS database includes all nonprofits 
with total assets greater than $25 million, plus a stratified random sample of smaller non- 
profits. Thus, the NCCS database does not include all nonprofits in every year, but it does 
capture the bulk of the economic activity of the nonprofit sector. The second nonprofit 
database is from Guidestar, a project of the Urban Institute sponsored by the Internal 
Revenue Service. Guidestar provides scanned images of IRS form 990s on their web page 
(http: // www.guidestar.org) and is the most comprehensive database of nonprofit financial 
information available, but unlike the NCCS data discussed above, the Guidestar data must 
be hand collected. 

We restrict our samples to IRC 501(c) (3) organizations but exclude private founda- 
tions? We compare reported amounts of fundraising and program expenses on the IRS 990 
to two other distinct sources of data including Internet web pages and financial statements. 
We presume that audited financial statements are less likely to be subject to manipulation 
and are therefore useful to calibrate the accuracy of the reported amounts on the IRS 990; 
we discuss this issue in more detail below. 


Comparison of the IRS 990 to Internet Web Pages 

For our first test we use the samples of educational, medical, and charitable nonprofit 
IRS 990 data as contained in the 1998 NCCS database. We use 1998 data as it was the 
most recent year of data available when this portion of our study was conducted. We limit 
our analysis to those nonprofits that receive public donations (from individuals and cor- 
porations) because the other two types of donations (i.e., government grants and feeder 
donations such as those from the United Way) can frequently be raised with little consistent 
fundraising effort. The intent of our analysis is to examine a sample of nonprofits tbat 
receive public donations and yet report zero fundraising expenses to determine if they 
should be reporting non-zero amounts of fundraising expenses. We present the results of 
this analysis in Table 1, which shows that in the 1998 sample there were 663 educational, 
2,090 medical, and 2,242 charitable nonprofits that reported zero fundraising expenses 


? Foreign nonprofits cannot solicit tax-deductible contributions in the United States, nor are they generally granted 

tax exemption on their activities within the U.S. All nonprofits granted federal and state tax exemption must be 

incorporated in a U.S. state (or Puerto Rico or the District of Columbia) and thus would be subject to form 990 

irements. 

Many 501(c)(3) public charities use the word "foundation" in their title even though they are legally organized 

and operated as public charities. See Sansing and Yetman (2006) for a discussion of the differences between 

public charities and private foundations. 

* Feeder organizations undercake fundraising activities and then distribute donations received to downstream non- 
profits. Although downstream nonprofits have to incur at least some amount of fundraising to obtain feeder 
funding, it is possible that the amounts expended per dollar of donations are small. With respect to government 
grants, fundraising can be lumped into one year, while a multiyear grant could be received over several years 
with no additional fundraising effort. 
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although they received public donations. We randomly selected (using a random number 
generator) 10 percent of these zero reporters (i.e., 66 educational, 209 medical, and 224 
charitable nonprofits) for further examination. Panel А of Table 1 provides descriptive 
statistics, which shows that the Internet sample has average revenues of $69 million, public 
donations of $4 million, and total assets of $99 million in 1998. In addition, the average 
program ratio is 0.82 and the average age is 29 years. 

Next we searched for our sample organizations' Internet websites using the Yahoo! and 
Google search engines and were able to find Internet websites for 38 educational, 91 med- 
ical, and 125 charitable nonprofits. Two investigators (one of the authors and a graduate 
research assistant) conducted separate extensive detailed searches of each Internet website 
for evidence of fundraising activities that would produce reportable expenses on the IRS 
990s. For our purposes, we consider evidence of reportable fundraising activities to be 
direct solicitations or instructions on how a potential donor could contribute and direct 
evidence of at least one individual responsible for fundraising activities (such as an em- 
ployee with the title of “director of fundraising" or "4ігесіог of donations"). We classified 
observations as having evidence of fundraising activities only if both investigators found 
both types of evidence (1.е., solicitation and personnel) on the website. During our Internet 
website search we found evidence of fundraising activities that met our hurdle for 26 
educational, 13 medical, and 72 charitable organizations. 

We recognize that some nonprofits have related unconsolidated fundraising foundations 
that perform their fundraising. To address the effects of fundraising subsidiaries we searched 
the Guidestar nonprofit database for related entities for the sample of 26 educational, 13 
medical, and 72 charitable nonprofits for which we found evidence of fundraising. After 
removing observations that have related fundraising foundations, we find that 23 of 38 (60 
percent) of educational, 6 of 91 (7 percent) of medical, and 72 of 125 (58 percent) of 
charitable nonprofits are likely to have inaccurately reported their fundraising expenses on 
their IRS 990. In total we find that about 40 percent of nonprofits examined (101 out of 
254) that report zero fundraising expenses on their IRS 990 probably incur some reportable 
fundraising expenses. 

We also recognize that it is possible that the organizations used volunteer labor for 
their fundraising activities. For this reason our search was for not only a fundraising em- 
ployee, but also for evidence of solicitation activities. By their nature, solicitation activities 
are not donated (electricity, computer time, supplies costs, allocated building costs, etc.) 
and would thus produce reportable fundraising expenses on the IRS 990. However, to gain 
further assurance that the identified employees were not all voluntary, we examined the 
IRS 990s of all organizations in the sample for evidence that they used volunteer labor 
(line 82a on part VII of the IRS 990). We found that only three of our sample organizations 
reported receiving volunteer labor, and those amounts were relatively small. Thus, we feel 
confident that the use of volunteer labor is not the cause of zero reported fundraising 
expenses as identified by our Internet search. In summary, this portion of our analysis 
provides evidence that nonprofits understate the extent of their fundraising activities on 
their IRS 990s. 


5 Note that, on average, 37 percent of our sample observations do not report fundraising expenses, while арргох- 
imately 50 percent of the population does not report fundraising. Our sample has a lower percentage of zero 
reporters because we exclude organizations that earn their donations from feeder organizations or from the 
government, which as previously discussed can be earned with reduced fundraising efforts. 

$ We limit our analysis to the zero reporters because our intent in this section is to provide evidence that the zero 
fundraising reporters actualty incur some fundraising. 
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Comparison of the IRS 990 to Audited Financial Statements 


For the second test of understated fundraising expenses, we compare a sample of au- 
dited financial statements to matched sets of IRS 990s. We presume that the audited finan- 
cial statement is more accurate with respect to reported amounts of fundraising expenses 
as the financial statements are required to be prepared in accordance with Generally Ac- 
cepted Accounting Principles (GAAP) and are subject to independent verification by au- 
ditors. Nonprofits are not required by federal law to produce financial statements (other 
then their IRS 990) nor are they required to have audited any financial statements they do 
prepare. However, it is common for nonprofits to prepare financial statements and undergo 
audits for several reasons. First, many granting agencies such as the United Way and federal 
and state governments frequently require audits as a condition of funding. Seccnd, lenders 
frequently require audited financial statements as a condition of funding (over 60 percent 
of nonprofits have some form of debt outstanding). Third, many boards of direc:ors require 
audits for their nonprofits for financial oversight reasons. Finally, many states require non- 
profits with assets above certain threshold amounts to undergo financial statement audits if 
they solicit donations within that state. Because financial statements are generally required 
for some specific purpose or request, they typically are not available to the genzral public. 
Nonetheless, audited nonprofit financial statements must conform to GAAP and should 
properly report the amounts expended on programs, administration, and fundraising to the 
extent those amounts are material. 

To obtain a sample of audited financial statements, we sent a request to 1,824 organi- 
zations requesting copies of their audited financial statements. Of the 1,824 observations 
queried 216 (12 percent) voluntarily supplied us with copies of their audited financial 
statements. On average each nonprofit sent us two years of financials, providing us with a 
sample of 401 firm-year observations. We excluded observations that did not earn any public 
donations (1.е., from individuals and corporations) for reasons discussed earlier and obser- 
vations that reported zero fundraising expenses on their audited financial statements (but 
we did not exclude observations that reported zero fundraising expenses on their IRS 990)." 
We also took great care to ensure that the financial statements and the related IRS 990s 
were for the same entities and groups by matching on total revenues, total expenses, ad- 
dresses, and employer identification numbers. After these screens, 101 firm-year observa- 
tions representing 55 unique nonprofits remain.* 

Table 2, Panel A reports descriptive statistics, which shows average revenues of around 
$40 million, public donations close to $14 million, and total assets of $108 million. The 
average program ratio is 0.77 and the average age of the nonprofit is 42 years.? Table 2, 
Panel B shows that the mean reported fundraising on the financial statements is larger than 
that reported on the IRS 990 by about $2.7 million, which is statistically significant. Of 
the 101 observations in the sample, 52 organizations report nearly the same fundraising 


7 A nonprofit must report fundraising expenses, regardless of the magnitude, on its 990. Thus, if a nonprofit 
reports some fundraising on its financials but not on its 9908, then it is likely that the nonproft misreported 
fundraising expenses on its 990. However, due to materiality considerations that drive financial statement prep- 
aration (but not the 990s) it is not possible to draw any inferences when a nonprofit reports some fundraising 
expenses on its 990 but zero on its financial statements. 

* We replicate our analysis by year, such that each nonprofit is included only once in each yearly analysis. The 
results are substantively the same. 

9 In contrast, untabulated analysis based on the population of nonprofits available in the NCCS dacabase reveals 
that the response sample reports smaller program ratios than the population, which is consistent with nonprofits 
that are less aggressive misreporters being willing to voluntarily supply their otherwise confidential audited 
financial statements. If it is true that less aggressive nonprofits were more willing to supply us with their financial 
statements, then our estimates of understated fundraising expenses on the 990 are conservatively biased. 
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expenses on the IRS 990s versus the audited statements (i.e., within 5 percent of the amount 
reported on the financial statements), 11 report more fundraising on their IRS 990s, and 38 
report less fundraising on their IRS 990s.!? For the 11 that report more fundraising on their 
IRS 990s, the mean difference is $673,960, whereas for the 38 that report less fundraising 
on their 990, the mean difference is more than ten times greater ($7,277,797); the difference 
between the two groups is statistically significant. Because these unscaled numbers are 
likely subject to skewness, we also examine the program ratio and again find a statistically 
significant difference. For the 11 observations that report more fundraising on their 990, 
the mean effect on the program ratio is only 1 percent, whereas for the 38 that report less 
fundraising on the 990, the mean effect is 7 percent; the difference is statistically significant. 
Interestingly, we find that of the 38 observations that underreported their fundraising on 
their IRS 990s, only 14 of them reported zero fundraising. Thus, the occurrence of under- 
reported fundraising is pervasive to many nonprofits, including those who report non-zero 
fundraising in their IRS 990s. 

To summarize our results thus far, our study provides evidence that nonprofit organi- 
zations are understating their fundraising expenses, and as a result are likely overstating 
their program expenses. This has the effect of making nonprofits appear more efficient to 
stakeholders and regulators, potentially affecting resource allocation and regulating deci- 
sions. Our results provide among the first empirical evidence supporting the widely held 
belief that expenses reported on the IRS 990 are likely to be misstated. 


IV. ANALYSIS OF COST-SHIFTING INCENTIVES 

The purpose of this portion of our analysis is to determine if the incidence of reporting 
zero fundraising expenses is the result of random error, accounting rule changes, expertise, 
or managerial incentives consistent with the predictions of agency theory. If zero reporting 
is random, then we will fail to find any statistically significant association with managerial 
incentives or other factors. In contrast, if expense misreporting is associated with manager- 
ial incentives, then our results have important policy implications (1.е., for donations allo- 
cation and governance decisions by stakeholders) as well as implications for researchers 
who use nonprofit data as the basis of their empirical analyses. 

Measuring managerial incentives to manipulate financial information is a difficult task 
as it necessarily involves measuring an unobservable (latent) variable. Fortunately, prior 
literature identifies two clear incentives a nonprofit manager has to manipulate their infor- 
mation and provides us with ways to measure these incentives on a firm-specific basis. As 
discussed earlier, the first incentive arises from nonprofit managers' desire to maximize 
managerial pay, and the second incentive arises from nonprofit managers' desire to maxi- 
mize donations received. To estimate these incentives at the firm level, we rely on the 
findings of prior research that has shown a direct relationship between nonprofit managerial 
compensation and reported program ratios (Baber et al. 2002). The results of Baber et al. 
(2002) are consistent with nonprofit firms using compensation to align the incentives of 
managers and external stakeholders, and to reduce agency losses. Additionally, prior re- 
search finds that donors use similar ratios to evaluate the efficiency of nonprofits' operations 
while making their donation allocation decisions (Baber et al. 2001; Callen 1994; Okten 
and Weisbrod 2000; Posnett and Sandler 1989; Tinkelman 1999; Weisbrod and Dominguez 
1986; Yetman and Yetman 2003). 


10 Similarly, Froehlich et al. (2000) find that 50 percent (average of 45 percent in 1988 and 55 percent in 1994) 
of their sample reports the same amount of fundraising expenses on their audited financials and on their 990 
(matched to within 10 percent). 
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We next discuss the details of how we derive these two incentive metrics. We refer to 
the metrics as sensitivity variables, as they are intended to capture the sensitivity of com- 
pensation of the nonprofit manager to the reported program ratio, and the sensitivity of 
donations to the reported program ratio. 


Incentive Variables 
Compensation Sensitivity Variable 

Baber et al. (2002) conjecture and find evidence that changes in compensation are 
significantly associated with accounting-based performance measures, measured by changes 
in program spending resulting from both changes in revenue raised and changes in the 
program spending ratio. They separate these components in their model, where Comp is 
compensation paid to CEO, Revenue is total revenue, and AYield measures the extent to 
which the change in program spending is attributable to changes in the program spending 
ratio. 
Following Baber et al. (2002), we derive our organization-specific compensation sen- 
sitivity variable by estimating the following model by firm: 


%АСотр, = В, + B,?eARevenue, + B,%AProgram Ratio, + є, (1) 
where: 


%AComp, = Comp, KI Comp, | 


Comp,.., ( 
ARevenue,= Revenue, — Revenues. 
Revenue,_, 
Program, _ Program,_, Penis 
Revenue, Revenue, і 
%AProgram Ratio, = 3 


Program, 


Comp is officer’s compensation from line 25a of IRS 990, Revenue is total revenues from 
line 12 of IRS 990, and Program is program expenses on line 13 of IRS 990. With the 
exception of Comp, these variable definitions follow those in Baber et al. (2002). Note 
that our variable Program Ratio is identical to Yield in Baber et al. (2002). 

Using a cross-sectional analysis, Baber et al. (2002) find a statistically significant pos- 
itive B; in Equation (1), which they interpret as evidence that nonprofit officer’s compen- 
sation is increasing in the reported program ratio. We use a time-series analysis by firm to 
derive organization-specific estimates of B,, which measure the sensitivity of compensation 
to the reported program ratio,. after controlling for the effect of changes in revenue. The 
more sensitive a manager’s pay is to the reported program ratio, the more incentive a 
manager has to improve that ratio so as to garner higher compensation. 


П We measure Comp as the total compensation paid to officers, directors, and key employees, whereas Baber et 
al. (2002) use the salary of either the CEO or the highest paid employee. We do not apply the Baber et al 
(2002) procedure because about 30: percent of organizations do not report CEO compensation, or provide spe- 
cifics about the compensation paid to the five highest-paid employees, and our objective is to examine the entire 
population of nonprofits that file 990s. 
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Donations Sensitivity Variable 
We derive our organization-specific donations incentive variable by estimating the fol- 
lowing model by firm: 


%ADons, = Во + В,%АКеуепие, + B,%AProgram Кано, + £, (2) 
where: 


ФАропз, = Dons, — Dons, , 
Dons, , 

and Dons is public donations from line 1a of IRS 990. This model is a straightforward 
extension of Model 1, where we replace the change in compensation by change in donations 
as the dependent variable. Although Baber et al. (2002) do not investigate the effects of 
changes in the program ratio on changes in donations received, there is considerable support 
in the literature that donations are positively associated with the program ratio (see Okten 
and Weisbrod [2000] for a discussion of this literature).!? 

The organization-specific coefficients В, are estimates of how sensitive changes in do- 
nations received are to changes in reported program ratios, after controlling for the effects 
of changes in revenue. The more sensitive the amount of donations received is to the 
reported program ratio, the more incentive a manager has to improve that ratio so as to 
garner higher amounts of donations. 


Empirical Estimation of Incentive Variables 


We use data from the 1985 to 2001 NCCS database and for reasons discussed earlier 
require that each firm reports non-zero values for all variables and that they do not report 
any government grants or feeder donations. We require at least eight usable years of data 
(results are not sensitive to required time-series from six to 12 years). To control for outliers 
in Models (1) and (2) we winsorize all observations at the 1st and 99th percentile as well 
as remove observations with a studentized residual greater than 2 or less than —2. After 
these screens there are 6,826 valid observations for Model 1 and 26,374 valid observations 
for Model 2.13.14 


12 There are many alternative models that we could use to estimate the relationship between donations and the 
reported ratios other than this straightforward extension of Equation (1). To ensure that our results with respect 
to the donations sensitivity variable are not driven by the specific model above, we also estimate the sensitivity 
variable using the methodology of Greenlee and Brown (1999), who examine the effects of the administrative 
ratio (ie., the ratio of administrative expenses to total expenses or to total revenues) on donations with no 
changes in the inferences of our results. A separate line of literature examines the effects of reported fundraising 
expenses on donations (Okten and Weisbrod 2000; Weisbrod and Dominguez 1986). We are not able to use 
these models to estimate our sensitivity variables as they did not directly inzlude the program ratio as a regressor, 
but rather included the ratio of fundraising expenses to total expenses, which is zero for many observations in 
our sample. The presence of zero values reported by several nonprofits over time would produce unusable 
sensitivity measures because of our firm-specific regression methodology. 

13 We are unable to conduct this analysis using only the sample of firms that are analyzed in Section Ш (Table 
2), because of sample size limitations. After applying the constraints required for Models 1—3 (such as number 
of years of firm-level data availability) on our already small samples from Section Ш, we are left with too few 
observations to estimate our regression model. Hence, we use the broad set of nonprofits and use an indicator 
variable for zero reported fundraising expenses as a proxy for underreported fundraising expenses, because our 
analysis in Section Ш suggests that these organizations are misreporting their fundraising expenses. 

14 Recall that the data screen for Model 1 required positive values for the change in compensation variable, which 
was not the case for many observations, causing the number of usable observations for Model 1 to be smaller 
than those for Model 2. To ensure that our results are not sensitive to this, we replicated our results in Model 
2 limiting the analysis to the same observations used in Model 1 (if the data were not missing) with no change 
in the inferences of our results. 
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Table 3 presents the descriptive statistics for the regression variables used for estimating 
Models (1) and (2) as well as the results of those regressions. The mean and median officer's 
compensation (in total for the nonprofit) are $407 and $204 thousand, respectively. Baber 
et al. (2002) found a mean and median of $131 and $113 thousand, respectively. Our 
numbers are higher as we use the total amount of officer's compensation per nonprofit, 
which aggregates the pay of key officers, whereas Baber et al. (2002) report the average 
compensation paid to a single person. The mean and median percentage changes in officer's 
compensation are 7.66 and 6.97, respectively, which are very similar to those found by 
Baber et al. (2002) (i.e., mean of 6.84 and median of 5.65). The closeness of our compen- 
sation change statistics to those of Baber et al. (2002) suggests that the use of aggregate 
managerial pay is reasonable. The mean and median changes in the Program Ratio are 
—1.16 and 0.21, which are also reasonably close to the figures obtained by Baber et al. 
(2002) (i.e., mean of —2.18 and median of —0.40). The similarity of our statistics with 
those found in Baber et al. (2002) provide comfort that our sample composition, although 
different from Baber et al. (2002), is reasonable for the given analysis. 

In the lower portion of Table 3 we present the results of the firm-specific regressions. 
Approximately 25 percent of our observations have negative values for their firm-specific 
estimated sensitivity measures, which we recognize lack economic meaning and are an 
artifact of the estimation process due to small numbers of observations at the firm level. 
Because these negative sensitivities make no economic sense, we reset all negative values 
to be equal to zero because in theory these sensitivity variables are bounded below at zero. 
We use these truncated sensitivity measures for our primary tests, although we also under- 
took two alternative approaches to ensure that our results were not driven by this truncation. 
First, we use the sensitivity variables as produced and, second, we exclude observations 
with a negative sensitivitv. Although the magnitudes of the results vary with these additional 
tests, the general tenor of the results remains unchanged with respect to statistical 
inferences. 

For the compensation sensitivity regression we find a mean (median) organization- 
specific coefficient for the change in the program ratio of 0.414 (0.099). Baber et al. (2002) 
obtained a pooled-cross sectional coefficient estimate of 0.076 for their full sample, which 
is close to our median estimate. With respect to donations sensitivity, we find mean (median) 
values of 0.866 (0.076). We hypothesize that the benefits (i.e., payoff) to managing the 
reported amounts of program and fundraising expenses is increasing in these two sensitivity 
measures, and therefore ratio management (measured as reporting zero fundraising ex- 
penses) will be increasing in these sensitivity metrics. In addition to these two sensitivity 
variables, we include other independent variables, which we discuss in the next section. 


The Association between Managerial Incentives and Reporting Zero Fundraising 

The compensation and donation sensitivity measures are intended to capture firm- 
specific incentives to improve the fundraising and/or program ratios. As previously noted, 
one way to improve the ratios is to under-report the amount of fundraising expenses and 
over-report the amount of program expenses. We use under-reported fundraising expenses 
as our dependent variable to examine whether managers whose compensation and donations 
are relatively more sensitive to the reported program ratio are more likely to improve their 
financials by understating their fundraising expenses. 


Other Independent Variables 
We include two measures reflecting changes in accounting rules. The first is an indictor 
variable equal to 1 if the year is greater than or equal to 1995, which represents the effective 
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year of SFAS Nos. 116 and 117, both of which changed the format of nonprofit financial 
statements from a fund accounting orientation to a commercial accounting orientation. 
SFAS No. 117 had several provisions, including requiring specific financial statements and 
specific forms of those statements, which could affect the reported amounts of expenses. 
SFAS No. 117 requires that material amounts of expenses (program, administrative, and 
fundraising) be stated separately in the financial statements or notes, a requirement that 
could decrease the probability that a nonprofit would misreport its ехрепзез.15 The primary 
purpose of SFAS No. 116 was to clarify the definition of a “contribution” (versus other 
types of revenues resulting from “exchange or agency" transactions such as membership 
dues or sponsorships) and thus, because of matching, also necessarily addresses if and 
when to record the related “fundraising expense." Although SFAS Nos. 116 and 117 did 
not have a direct effect on how the IRS 990 is prepared, to the extent that audited financial 
statements are used as a basis for preparing the IRS 990, the passage of the new accounting 
rules could have had an effect on IRS 990 reporting. Indeed, Jones and Roberts (2006) find 
that most nonprofits that undergo an audit on their financial statements prepare their IRS 
990 in accordance with SFAS Nos. 116 and 117 when those rules do not disagree with 
IRS 990 preparation rules. 

We include a second indicator variable equal to 1 if the year is greater than or equal 
to 1998, the effective date of the American Institute of Certified Public Accountant's State- 
ment of Position 98-2 (SOP 98-2), which changed the way nonprofits report the expenses 
associated with combined fundraising and educational activities. Prior to SOP 98-2, many 
nonprofits classified all costs related to these joint campaigns as programs, which had the 
effect of inflating the program ratio and deflating the fundraising ratio (Jones and Roberts 
2006). SOP 98-2 requires nonprofits to partition these costs between programs and fund- 
raising. Roberts (2005) and Jones and Roberts (2006) examine the effects of SOP 98-2 on 
nonprofit financial reporting and in general find that it had a positive effect on nonprofit 

` financial reporting quality. 

In addition to incentives and accounting rule changes, inherent expertise could play a 
role in the accuracy of a nonprofit's financial information. Although it is difficult to directly 
measure the construct of expertise, we include the use of an outside accountant (as reported 
on line 31a of the IRS 990) as a reasonable proxy. We recognize that an alternative 
interpretation of this variable is that the use of an outside accounting firm could indicate 
that the nonprofit received a financial statement audit, and the resulting audited financial 
statements could provide the basis for a more complete and correct IRS 990. We therefore 
interpret this variable with caution. As discussed later, we also partition our models across 
size and age to calibrate the effects of potential expertise that is associated with larger or 
older nonprofits, as age and size could be measures of inherent expertise or general level 
of care. 

We also consider the role of financial need in the reporting decision. In theory, a 
nonprofit with more pressing financial need would be more inclined to manipulate their 


5 For example, SFAS No. 117 states that donations must not be netted against outside fundraising costs. If a 
nonprofit obtains all of its donations from outside fundraising firms, prior to SFAS No. 117 they could have 
correctly reported net donations (i.e., net of fundraising costs paid to the outside fundraising firm). However, 
after SFAS No. 117 the nonprofit must report gross donations and any associated fundraising costs. This rule 
change could have an impact on the propensity to report fundraising expenses on IRS Form 990. 

We chose to use a dichotomous variable rather than a continuous one (1.е., payment to outside accountant) 
because the relevant partition is whether or not the organization hires an outside accountant at all, and not on 
how much it pays that accountant. Certainly larger firms will pay their outside accountants more, but this is not 
necessarily an economically meaningful relationship. Nonetheless, we tested the robustness of our results to the 
inclusion of the continuous variable (payment to accountant) with no change in the tenor of the results. 
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reported financial ratios so as to garner additional donations. However, there are several 
difficulties with empirically measuring the concept of financial distress. First, we recognize 
that it is difficult to capture timing effects. If an organization in financial distress manipu- 
lates its financial information and then receives additional donations in the subsequent 
period, then in those subsequent periods there would not necessarily be a relationship 
between financial need and reporting choices as the financial need would ostensibly be 
overcome. In an attempt to mitigate this timing effect we use a lagged value of financial 
need, although this is an incomplete method as the number of lags is unknown (we do not 
know how long ago the organization first manipulated its financial reports as a response to 
financial distress). Second, it is not clear how one would measure the construct of financial 
need in the nonprofit setting. To this end we rely on a working paper by Keating et al. 
(2005) who find that the ratio of net assets to total assets is a predictor of nonprofit financial 
distress measured as either insolvency risk (negative net equity) or financial disruption risk 
(а 25 or more percent decline in net assets).!" Because of the theoretical problems discussed 
above we estimate our model separately with and without the financial need metric to ensure 
that our primary results are not affected by the inclusion of the financial need variable. 

Additional control variables include Size (total assets from line 59b of the IRS 990 
divided by one million) and a trend variable equal to 1 in 1985, 2 in 1986 and so on. The 
trend variable is intended to control for increases in nonprofit financial reporting quality 
across time, ensuring that the accounting change variables are independent of general in- 
creases in nonprofit reporting quality over time. The results are not sensitive to the inclusion 
of additional control variables such as total donations, total revenues, and the square of 
size.!? 


Empirical Model 
Our empirical model takes the general form: 


Misreported Expenses, = а + B,Compensation Sensitivity, 
+ B,Donations Sensitivity, + B4Outside Accountant, 
+ B4,SFASI16/117, + BSOP 98-2, + Везе; 
+ B,Trend,, + В,Еталсїа! Need, | + Ep (3) 


To ensure that our statistical inferences are not influenced by the effects of using panel 
data, which has the tendency to deflate the standard errors of the regression estimates and 
increase the chance of finding significant relationships, we estimate our models using two 
methods that control for the effects of non-independence across time. First, we cluster the 
standard errors at the firm level. Second, we use annual Fama-Macbeth regressions, where 
statistical significance is based on the distribution of the annual coefficient estimates. As 
noted by Petersen (2004), each of these methods has its relative strengths and weaknesses. 


17 Keating et al. (2005) find that several other ratios are also indicators of nonprofit financial distress. We used 
each of their measures and found mixed results. While some of their financial distress measures are associated 
with the probability of reporting zero fundraising expenses (ie., net assets/total assets and net assets/total 
revenues), others are not (i.e.. net income/total revenue and the annual change in net income). 

18 In Table 3 we find that compensation and donations are related to changes in program spending that arise from 
changes in revenues and changes in program ratios. While managers may seek to increase their revenue so as 
to garner higher compensation and donations, the link between this incentive and expense misreporting is not 
clear, hence we exclude these incentive variables (the B,s from Equations (1) and (2)) from our incentive tests. 
However, the results of estimating Equation (3) are robust to including these additional variables. 


The Accounting Review, March 2006 


Expense Misreporting in Nonprofit Organizations 415 


The Fama-MacBeth approach is more standard in the accounting and finance literatures, 
but it does not allow us to include the variables for accounting rule changes, whereas the 
corrected standard error approach does. We present our clustered standard error results as 
they allow us to use a more complete model, and discuss our Fama-MacBeth results. 

Our methodology involves two stages; first, we estimate our sensitivity measures by 
regressing compensation or donations on the program ratio, and second, we test for incen- 
tive effects by regressing expense misreporting (as a zero fundraising expense indicator) 
on our sensitivity measures. Because program expenses appear in the first stage and are 
related to fundraising expenses (total expenses are the sum of program, fundraising, and 
administrative expenses), while fundraising expenses appear in the second stage (as the 
dependent variable), it might appear that our sensitivity variables are mechanically corre- 
lated with fundraising introducing endogeneity bias into our results. For example, if a 
nonprofit systematically reports zero fundraising expenses (our dependent variable in equa- 
tion 3), then it is likely to have systematically higher program expenses (a variable used in 
Models 1 and 2.to derive the sensitivity variables). However, because we estimate our 
sensitivity metrics using changes at the firm level in the first stage (Equations (1) and (2), 
our variables are free of any mechanical relationship with the dependent variable in the 
second stage (i.e., Equation (3)). Even if a nonprofit systematically overstates its program 
expenses by reporting zero fundraising expenses, any percentage changes in its reported 
program expenses, holding fundraising expenses a constant at zero, will be free of me- 
chanical bias.!? 

Table 4 contains the descriptive statistics for the incentive analysis variables. The de- 
pendent variable, Misreported Expenses, is an indicator variable equal to 1 if the organi- 
zation reported zero fundraising expenses, and 0 otherwise. We use this measure of expense 
misreporting based on the results earlier in the analysis, which provided evidence that many 
of the firms that report zero fundraising expenses are under-reporting those expenses. Cer- 
tainly this metric is not without error, as undoubtedly many firms that report some fun- 
draising expenses on their 990 under-report those expenses and it is possible that some 
firms that report zero fundraising on their 990 truly incur zero fundraising expenses. In 
both of these cases we misclassify our dependent variable causing measurement error. 
However, unless this error in the dependent variable is correlated with our independent 
variables our results will be unbiased. As shown in Table 4, approximately 50 percent of 
our sample reports zero fundraising expenses, despite earning public donations. The com- 
pensation and donations sensitivity variables are discussed earlier in this section and re- 
ported in Table 3. The statistics of the incentive variables are slightly different across Tables 
3 and 4 because the sample in Table 4 is limited to those observations with usable data for 
all of the regression variables used in Equation (3). Approximately 82 percent of our ob- 
servations hire an outside accountant, while the average nonprofit is 36 years old. 

Next we discuss the results of estimating Equation (3). Because the underlying distri- 
bution of some of the variables appears to be skewed, we also estimated our models using 
ranks with no change in the statistical inferences of the results. 


Regression Results 
Table 5 contains the results of the analysis for the full sample as well as industry 
partitions (i.e., charitable, educational, and medical). Model 1 of Table 5 reports results for 


19 To ensure that our results are not affected by observations that report zero fundraising in some years and positive 
amounts in others, we re-estimated our results on the sample of observations that report positive or zero fun- 
draising in all sample years, with no change in the inferences of the study. 
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TABLE 4 
Descriptive Statistics Analysis of Characteristics Associated with 


Understated Fundraising Expenses 


- First Third Standard 
Variable ____ | Mean Оше Median Quartile || Deviation 
Misreported Expenses 0.50 0.00 1.00 1.00 0.50 
Compensation Sensitivity 0.42 0.00 0.09 0.39 0.97 
Donations Sensitivity 0.94 0.00 0.09 0.55 2.94 
Outside Accountant 0.82 1.00 1.00 1.00 0.38 
SFAS 116/117 0.54 0.00 1.00 1.00 0.50 
SOP 98-2 0.30 0.00 0.00 1.00 0.46 
Size 0.95 0.17 0.42 0.93 2.43 
Trend 9.88 7.00 10.00 13.00 3.83 
Financial Need 0.72 0.54 0.82 0.96 0.28 
Age 36.01 20.98 36.88 50.91 17.57 


The sample includes the 5,780 observations in the NCCS database years 1985 to 2001 that have non-missing 
values for all variables. 
Variable Definitions: 
Misreported Expenses = equal to 1 if the organization reported zero fundraising expenses, and 0 ctherwise; 
Outside Accountant = equal to 1 if any fees were paid to outside accountants (a non-zero amount on line 
31a IRS 990), and 0 otherwise; 
SFAS 116/117 = an indicator is equal to 1 post-1994, and 0 otherwise; 
SOP98-2 = an indicator is equal to 1 post-1997, and О otherwise; 
Size = total assets (in $100 millions); 
Trend = a time trend variable equal to 1 in 1985, 2 in 1986, and so on; and 
Financial Need = lagged ratio of net assets to total assets. 
Compensation and Donations Sensitivity variables are described and derived in Table 3. 


the full sample (using clustered standard errors) and shows that the probability cf reporting 
zero fundraising expenses is increasing in the relative sensitivities of the program ratio to 
both officer’s compensation and donations. This finding provides support for our agency- 
based predictions and is consistent with the notion that nonprofit managers irtentionally 
manipulate their publicly available financial information so as to attract higher compensa- 
tion and donations. We also find that the use of an outside accountant reduces the probability 
that a nonprofit will report zero fundraising expenses, consistent with the use of an outside 
paid accountant increasing the reliability and usefulness of nonprofit financial reports. Fi- 
nally, we find that although SOP 98-2 reduces the probability of a nonprofit reporting zero 
fundraising expenses, the enactment of SFAS No. 116/117 appears to have no effect on 
misreported fundraising expenses. This finding is consistent with the results of Jones and 
Roberts (2006), who document that SOP 98-2 had an effect on how nonprofits report the 
results of combined fundraising and educational campaigns. 

Models 2, 3, and 4 in Table 5 show little variation in these general results across the 
three primary nonprofit industries (1.е., charitable, educational, and medical nonprofits). 
The only exception to the full sample is that the use of outside paid accountants has no 
effect on expense misreporting by charitable nonprofits (although the sign of the coefficient 
is in the correct direction). Model 5 includes the financial need variable for the full sample 


2 Educational nonprofits include primary and secondary nongovernmental schools, medical nonprofits include 
primary care hospitals, and charitable nonprofits include all other types of charities. 
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TABLE 5 
Regression Analysis of Characteristics Associated with Understated Fundraising Expenses 
(t-statistics Clustered by Firm) 
Misreported Expenses, = а + B,Compensation Sensitivity, + B Donations Sensitivity, 
+ B,Outside Accountant, + B,SFAS No. 116/117, 
+ ВЗОР 98-2, + BgSize, + B,Trend,, 
+ ByFinancial Need; , + Ep 


Column 

E ыы mE Е: с. за eee 

Sample Composition Full Sample Charities Educational Medical Full Sample 
Intercept —0.1887 —0.0965 — 1.6028 1.5012 0.5581 
(0.4058) (0.7926) (0.0003) (0.0095) (0.0695) 
Compensation Sensitivity 0.6772 0.4834 1.3999 1.1952 0.6088 
| (<.0001) (0.0128) (0.0001) (0.0045) (<.0001) 
Donations Sensitivity 0.8470 0.8786 1.2236 0.9260 0.8061 
(<.0001) (<.0001) (0.0010) (0.0015) (<.0001) 
Outside Accountant —0.3607 —0.0857 —1.0303 —0.8998 -0.2994 
(0.0464) (0.7669) (0.0047) (0.0460) (0.0974) 
5ЕА5116/117 0.0291 —0.0048 —0.1116 0.3012 0.0026 
(0.7100) (0.9710) (0.4421) (0.1371) (0.9742) 
SOP98-2 —0.2293 —0.3123 —0.6678 —0.5252 —0.2090 
(0.0043) (0.0151) (0.0042) (0.0342) (0.0061) 
Size -0.0744 -0.1476 -0.0778 -0.3082 -0.0754 
(0.0121) (0.1541) (0.0364) (0.0035) (0.0350) 
Trend -0.0024 -0.0360 -0.0379 0.0305 0.0073 
(0.8954) (0.2014) (0.3303) (0.5074) (0.7055) 
Financial Need —1.1289 
(<.0001) 

Observations | 5,771 2,226 1,677 1,719 5,655 


The sample includes the 5,780 observations in the NCCS database years 1985 to 2001 that have non-missing 
values for all variables. 
Two-tailed p-values are in parentheses. 
Variable Definitions: 
Misreported Expenses = equal to 1 if the organization reported zero fundraising expenses, and 0 otherwise; 
Outside Accountant = equal to 1 if any fees were paid to outside accountants, and 0 otherwise; 
SFAS116/117 = an indicator is equal to 1 post-1994, and 0 otherwise; 
SOP 98-2 = an indicator is equal to 1 post-1997, and 0 otherwise; 
Size = total assets (in $100 millions); 
Trend = a time trend variable equal to 1 in 1985, 2 in 1986, and so on; and 
Financial Need = the lagged ratio of net assets over total assets. The educational sample includes private 
universities and secondary schools, the medical sample includes primary care hospitals, 
and the charity sample includes all other types of nonprofits. 
Compensation and Donations sensitivity variables are described and derived in Table 3. 


and shows that, as suspected, nonprofits with more pressing financial needs (as evidenced 
by a lower ratio of net assets to total assets) are more likely to report zero fundraising 
expenses (i.e., a negative coefficient suggests that a lower ratio of net assets to total assets 
is associated with a higher probability of reporting zero fundraising expenses). 
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We estimated similar tests using the Fama-MacBeth method where coefficients are 
averaged across annual regressions and the corresponding t-statistics are based on the dis- 
tribution of the annual coefficients. The inferences of the results (untabulated) using the 
Fama-MacBeth method are identical to those in Table 5. The consistency and robustness 
of our results provides comfort that they are not driven by intemporal correlations within 
firms. 

We also estimated the models using the Fama-MacBeth approach for various size and 
age groupings. The groupings are the lower quartile, middle half, and upper quartile group- 
ings of size and age. We group on these variables because it is possible that the effects of 
the independent variables on financial statement management could vary across size and 
age in ways that are not fully captured by simply including an age or size repressor. For 
example, it is possible that smaller nonprofits are inherently less interested in the quality 
of their financial reports, or they lack the necessary expertise, either hired from ће outside 
or employed on the inside, to prepare correct IRS 990s. Our results (untabulated show that 
our results hold across size and age partitions, suggesting that neither size nor age drives 
the overall results. Results are similar using the clustered standard error methoc. 

To summarize the results of this portion of our analysis, our findings are consistent 
with agency theory and suggest that nonprofits that have compensation and resource (i.e., 
donation revenues) incentives to manage their financial reports respond to those incentives. 
Our empirical analyses are consistent even when we use a variety of statistical models, 
sample partitions, and various definitions of the incentive variables. The finding that non- 
profit managers respond to incentives to manage their publicly available financial infor- 
mation has important policy implications as it suggests that nonprofits are providing donors 
and regulators with managed financial information that could possibly lead to incorrect 
donations allocations and governing decisions. 


V. CONCLUSIONS 

IRS Form 990 is the principle source of financial information for nonprofits, and the 
relative allocation of expenses among the categories on the IRS 990 is of primary interest 
to donors and other stakeholders. A growing body of anecdotal evidence notes the fact that 
over 50 percent of all nonprofits that receive donations also report zero fundraising ex- 
penses. It is widely suspected that this behavior is inappropriate, and perhaps even inten- 
tional, as a nonprofit can increase the amount of program expenses it reports to the public 
on its IRS 990 by shifting expenses from fundraising to programs. Our analysis provides 
empirical evidence supporting the notion that nonprofits inappropriately and perhaps even 
intentionally misstate their relative proportion of expenses in these three categories. 

We provide evidence suggesting that many nonprofits reporting zero fundraising ex- 
penses do, in fact, undertake fundraising activities. Further analysis shows that expense 
misreporting is associated with managerial incentives to garner larger amounts of mana- 
gerial pay and donations. 

Our findings have important implications for public policy to the extent that donors 
and other stakeholders allocate their resources among nonprofits or make regulating deci- 
sions based on their reported financial results. Our results also have important implications 
for nonprofit researchers as they suggest that organizations that report zero fondraising 
expenses should be carefully considered. Depending on the tvpe of analysis and question 
being asked it might be prudent for a researcher to exclude organizations from their analysis 
that report zero fundraising expenses. 
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ABSTRACT: Prior research. shows that during the pre-1990 bust financially weak banks 
managed income upward by delaying provisions for losses on heterogeneous loans. 
In contrast, we predict and find that during the 1990s boom profitable banks managed 
Income downward by accelerating provisions for losses on homogeneous loans. Prof- 
КаЫе banks obscured their income smoothing by accelerating charge-offs of homo- 
geneous loans and by recording more gross charge-offs to offset recoveries of previ- 
ously charged-off loans. Over the three years subsequent to the acceleration of 
charge-offs, they had higher and more persistent Income before provisions for loan 
losses than other banks, consistent with Income smoothing over a prolonged horizon. 


Keywords: income smoothing; business cycle; banks; provisions for loan losses; loan 
charge-offs. 


Data availability: АЛ data are available from public sources. 


L INTRODUCTION 
e investigate how commercial banks’ management of provisions for loan losses 
and loan charge-offs changed from their period of poor health and ultimately 
crisis (bust) from 1974—1990 to their period of robust health (boom) throughout 
the 1990s. A considerable body of prior research finds that during the pre-1990 bust banks 
delayed provisioning for loan losses, with this effect being stronger for financially weaker 
banks holding more heterogeneous loans. In contrast, we predict and find that during the 
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1990s boom banks accelerated provisions and made this less apparent by also accelerating 
charge-offs, with these effects being stronger for more profitable banks holding more ho- 
mogeneous loans. 

As in Liu and Ryan (1995), we exploit differences in how provisions and charge-offs 
are determined for heterogeneous and homogeneous loans under Generally Accepted Ac- 
counting Principles and related accounting practices. For heterogeneous (e.g., commercial) 
loans, provisions and charge-offs typically are determined with judgment on a loan-by-loan 
basis. For homogeneous (e.g., consumer) loans, provis:ons are determined using statistical 
methods at the portfolio level, while charge-offs are determined using number-of-days-past- 
due rules. Liu and Ryan (1995) predict and find that banks had more ability to delay 
provisions for heterogeneous than homogeneous loans during the pre-1990 bust. In contrast, 
we predict and find that banks had more ability to accelerate charge-offs for homogeneous 
than heterogeneous loans during the 1990s boom. 

We focus on banks’ acceleration of charge-offs of homogeneous loans to Өвөө their 
income smoothing, a previously undocumented discretionary behavior, not on the strategic 
reasons for that behavior. Numerous studies argue convincingly that banks have incentives 
to smooth their income, and in particular to store incame in booms such as the 1990s to 
cushion income in subsequent busts. Consistent with these incentives, most of the numerous 
empirical studies investigating this question—on samples drawn wholly or largely from the 
pre-1990 bust—find that banks do smooth their income using provisions for loan losses. 
Building on this prior research, we posit that banks prefer smoother income. 

We also posit that, in smoothing their income, banks prefer that their allowances for 
loan losses do not fluctuate too much, in order to avoid scrutiny by bank regulators, the 
SEC, auditors, and market participants. The primary way that banks can reconcile these 
preferences is by exercising discretion over loan charge-offs, for example, by accelerating 
charge-offs when the allowance for loan losses would otherwise be too high. Bank regu- 
lators and analysts facilitate this discretionary behavior by using the ratio of the allowance 
for loans losses to net loan charge-offs to assess the adequacy of banks' allowances for 
loan losses. 

We test and find support for four specific hypotheses about banks' provisioning and 
charge-off behavior during the 1990s boom. First, we predict and find that more profitable 
banks holding more homogeneous loans exhibit a more positive association between the 
provision for loan losses and income before the provision for loan losses, consistent with 
greater income smoothing by these banks. Second, we predict and find that more profitable 
banks charging off more homogeneous loans recover -hose charge-offs at a higher rate, 
consistent with greater acceleration of charge-offs by these banks. Third, we predict and 
find that more profitable banks recovering more charge-offs of homogeneous loans in a 
given year record more gross loan charge-offs in that year, consistent with these banks 
making new charge-offs to keep their allowances for loan losses from ballooning. Fourth, 
we predict and find that banks charging off more homogeneous loans had higher and more 
persistent income before the provision for loan losses over the next three years than did 
other banks, consistent with these banks smoothing income over a prolonged horizon. 

Collectively, our results strongly suggest that profitable banks managed their income 
downward and accelerated charge-offs of bomogeneous loans to obscure this discretionary 
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behavior over the prolonged 1990s boom.' Our paper is the first to provide evidence of 
banks’ management of loan charge-offs for the purposes of income smoothing; prior re- 
search typically treats charge-offs as a nondiscretionary control variable, perhaps because 
charge-offs do not directly affect income. It is also the first to provide evidence of banks’ 
.management of provisions and charge-offs for homogeneous rather than heterogeneous 
loans in boom times. This evidence adds to our understanding of the range and determinants 
of banks’ discretionary behavior. 

Our paper also contributes to the broader area of research on income smoothing. As 
discussed by Dechow and Skinner (2000), a debate remains in this literature—especially 
between academics and practitioners—as to whether, when, and how income smoothing 
occurs. In our view, a primary reason for the lack of resolution of this debate is that income 
smoothing is a more contextual and less purely symmetrical behavior than is captured by 
the usual research designs. We provide an example of such behavior, showing that banks 
manage income downward in boom times differently from how they manage income upward 
in bust times. 

The remainder of this paper is organized as follows. Section II develops our hypotheses 
based on the rules and practices governing the accounting for loan losses and prior research. 
Section III describes the sample and reports descriptive statistics. Section IV reports the 
empirical results. Section V concludes. 


П. HYPOTHESIS DEVELOPMENT 
. Accounting for and Disclosures of Loan Losses 

Under FAS No. 5 (FASB 1975), Accounting for Contingencies, when credit losses on 
a loan or portfolio of homogeneous loans are probable and can be reasonably estimated, 
an expense called the provision for loan losses and a contra-asset (to loans outstanding) 
called the allowance for loan losses should be recorded. When a specific loan is deemed 
uncollectible, it should be charged off, which involves reducing both loans outstanding and 
the allowance for loan losses by the uncollectible amount. Charge-offs do not directly affect 
total assets, net loans outstanding, owners' equity, or net income. Charged-off loans may 
turn out to be collectible, in which case the charge-offs are reversed and called recoveries. 
Net loan charge-offs equal gross loan charge-offs less recoveries. 

Banks are required to disclose nonperforming assets, which includes the entire book 
value of loans more than 90 days past due, restructured troubled debt, and repossessed real 
estate. Nonperforming assets is a relatively nondiscretionary and timely source of infor- 
mation about loan default, although it does not reflect either the probability of default or 
expected losses given default. 


Banks? Discretion over Provisions for Loan Losses 
The Effect of the 1990s Boom 

This study was motivated in part by our belief that banks saw their income increase in 
the early 1990s and immediately began overstating provisions for loan losses, initially to 
undo past under-reserving, but fairly quickly (no later than 1994) over-reserving. Consistent 
with this belief, from no later than October 1994 throughout the remainder of the 1990s 
boom, various governmental and regulatory bodies repeatedly expressed concern that banks 


! While not directly related to this paper, Moyer (1990), Beatty et al. (1995), and Collins et al. (1995) make 
hypotheses about and generally find evidence of the management of regulatory capital using net loan charge- 
offs. Loan charge-offs reduce regulatory capital to the extent that the allowance for loan losses is included in 
regulatory capital. The allowance for loan losses was included in primary capital up to 1990 and subsequently 
is included with limits in Tier 2 capital. 
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were over-reserved for loan losses and thereby smoothing their income across the business 
cycle. Most notably, in an October 1994 report, the General Accounting Office states that 
a sample of “institutions maintained significant amounts of unsupported reserves ... not 
clearly linked to likely losses" (U.S. GAO 1994). In November 1998, the SEC required 
SunTrust Banks—which reported a high level of gross charge-offs that it reccvered at a 
rate close to 40 percent from 1993-1997--40 reduce its allowance for loan losses by $100 
million. The public nature of this requirement was intended as a signal to all banks either 
to justify or reduce their allowances.? In January 1999, the SEC sent a form letter to selected 
bank holding companies suggesting specific enhanced disclosures regarding their allowance 
and provision for loan losses. In July 2001, the SEC staff issued SAB No. 102, Selected 
Loan Loss Allowance Methodology and Documentation Issues, which requires “а system- 
atic methodology to be employed each period in determining the allowance for loan losses 
to be reported," emphasizing that this methodology should be “consistently applied” 
through time. The major bank regulators contemporaneously revised their regulatory guid- 
ance to be consistent with SAB No. 102. 


Prior Research 

Prior research on banks’ management of provisions for loan losses falls into three 
distinct literatures. The first argues that banks have incentives to decrease income in good 
times such as the 1990s boom to allow increased income in subsequent bad times. Joyce 
(1996) argues that banks manage provisions for compensation purposes. Galai et.al. (2003) 
argue that banks create “hidden reserves” that they release as needed either to raise income 
to meet analyst forecasts or capital requirements. Kanagaretnam et al. (2003) hypothesize 
that banks that smooth income downward are more likely to obtain financing externally. 

The second literature examines empirically whether banks manage provisions for loan 

losses to smooth income. Most, though not all, of these studies find evidence consistent 
_ with income smoothing.’ Kanagaretnam et al. (2003) find that downward income smoothing 
is stronger for banks that are more likely to raise funds externally. Beatty et al. (2002) find 
that publicly traded banks are more likely than privately held banks to manage earnings 
upward to avoid earnings decreases. 

The third literature examines the market pricing of the allowance or provision for loans 
losses.* All of these studies are motivated at least in part by the idea that banks exercise 
discretion over provisions for loan losses, with several advancing the hypothesis -hat banks 
do so to signal their quality? A number of these studies examine the market's reaction to 
the March-April 1987 announcements of provisions for less-developed-country debt for 
which banks had clearly exercised discretion to delay making those provisions, given that 
much of this debt was known to be troubled since the early 1980s.° 


2 The “SunTrust Banks—After the Restatement” case in Ryan (2002, Chapter 5) examines this situation in detail. 

* Greenawalt and Sinkey (1988), Ma (1988), Collins et al. (1995), Robb (1998), Beatty et al. (2002), Kanagaretnam 
et al. (2003), and Gray (2004) provide evidence that banks smooth income using provisions for loan losses, 
while Scheiner (1981), Beatty et al. (1995), and Ahmed et al. (1999) do not. 

4 See Beaver et al. (1989), Wahlen (1994), Liu and Ryan (1995), Beaver and Engel (1996), Liu et al. (1997), 
Ahmed et al. (1999), and the papers on less-developed-country debt cited in footnote 6. 

5 Beaver et al. (1989), Musumeci and Sinkey (1990), Grammatikos and Saunders (1990), Wahlen (1994), and Liu 
et al. (1997) advance the signaling hypothesis. 

$ See Grammatikos and Saunders (1990), Musumeci and Sinkey (1990), Elliott et al. (1991), and. Griffin and 
Wallach (1991). Poland and other Eastern bloc countries had problems repaying debt in the early 1980s and 
Mexico and Brazil announced moratoria repaying interest and principal in late 1982. Boehmer anc Megginson 
(1990) report that the average market price for less-developed-country deb: in January 1986 ranged from 27.5 
cents on tbe dollar for Peru to 83 cents on the dollar for Columbia. 
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Effects of Loan Portfolio Composition 

Liu and Ryan (1995) argue that, under Generally Accepted Accounting Principles and 
related accounting practices, banks have differential ability to exercise discretion over pro- 
visions for different types of loans. For homogeneous (e.g., consumer) loans, provisions 
are determined primarily on a statistical basis at the portfolio level. For heterogeneous (e.g., 
commercial) loans, provisions are determined primarily on a judgmental basis at the 
individual-loan level. During bust periods when banks have incentives to delay provisions 
for loan losses, the statistical approach should yield timelier and less discretionary provi- 
sions for homogeneous loans than the judgmental approach yields for heterogeneous loans. 
Consistent with this argument, for a sample of banks during the pre-1990 bust, Liu and 
Ryan (1995) find that provisions for loan losses were highly untimely compared to non- 
performing assets for banks that held above-median heterogeneous loans, but relatively 
timely for banks that held above-median homogeneous loans. 

In contrast, we expect discretionary acceleration of both provisions and charge-offs for 
homogeneous loans during the 1990s boom. Homogeneous loans typically are charged off 
based on number-of-days-past-due rules, while heterogeneous loans typically are charged 
off based on judgment on a loan-by-loan basis. While the use of number-of-days-past-due 
rules may suggest that less discretion is applied to charge-offs of homogeneous than het- 
erogeneous loans, this turns out not to be true when banks desire to accelerate charge-offs 
in boom times, for three reasons. 

First, these rules interact with economic conditions, yielding charge-offs that are re- 
covered with much higher probability in good than bad times. While this could be viewed 
as a mechanical rather than discretionary aspect of these rules, in principle banks could 
and should adjust the number of days past due they use in these rules to remove this effect. 
In practice, however, they do not appear to do so, perhaps reflecting McNichols and 
Wilson's (1988) observation that as "receivables turnover decline (for example, in bank- 
ing), mechanical rules are more difficult to specify, leaving greater room for discretion over 
write-offs." 

Second, and more importantly, at least until bank regulators changed the guidance for 
charge-offs of consumer loans effective June 1999, as discussed below, banks employed 
discretion in determining and applying their policies regarding the number of days past due 
to charge off homogeneous loans, leading to time-series and cross-sectional variation in 
those policies.” To curb diversity across banks in charge-off practices for homogeneous 
loans, in February 1999 the Federal Financial Institutions Examination Council (FFIEC, a 
cooperative group of the main bank regulators) revised guidance for charge-offs of con- 
sumer loans that had been in place since 1980, stating that “[p]revious policy guidance had 
been interpreted and applied inconsistently,’ “[c]harge-off practices ranged from 120 to 


? Until pressured by the SEC to do so in late 1998 (via the SunTrust restatement) and early 1999 (via the sample 
letter sent to bank holding companies [SEC 1999]), banks rarely disclosed their charge-off policies in their 
public filings. Hence, during our sample period, it is generally not possible to determine when and how banks 
changed their charge-off policies. We determined that a fair number of such changes did occur by conducting 
a web search using "charge-off policy" and “change.” For example, Standard & Poor's Creditwire explained 
the change in charge-off rates for an index of credit card-backed securities from June to July 1997 as being 
significantly affected by First USA's decision to charge off credit card accounts at 180 days past due rather than 
210 days past due it had used previously. 
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240 days" past due, and “a number of institutions were not following existing policy 
guidance for charging off open-end accounts based on past due status." ? 

Third, charge-offs of heterogeneous loans typically are based on loan-by-Ican evalua- 
tions by loan officers, whose incentives generally work against acceleration of charge-offs 
and whose decisions are harder to coordinate. Office of the Comptroller of the Currency et 
al. (1993) states that “ап effective credit grading framework generally places primary re- 
liance on loan officers to identify emerging loan problems." However, in a 2001 speech, 
Federal Reserve Board Chairman Alan Greenspan stated “regrettably, incentives for loan 
officers ... traditionally have rewarded loan growth, market share, and the profits that derive 
from booking interest income with, in retrospect, inadequate provisions for possible de- 
fault." Udell (1989) and Berger and Udell (2002) discuss and provide evidence about loan 
officers' informational advantages in monitoring loans and their incentive to hide loan de- 
fault on the loans they originated. They argue that loan officers devote insufficient time to 
monitoring existing loans, because their compensation is based disproportionatelv on short- 
term revenues. Berger and Udell (2003) emphasize that this time allocation issue and other 
incentive problems are more likely to arise in booms, such as the 1990s, when -he oppor- 
tunities for loan growth are highest and when realized loan default has been low for ex- 
tended periods. 


Hypotheses 

Building on the prior research discussed above, we posit that banks prefer smoother 
income, all else being equal. During the 1990s boom, smoothing usually involved banks 
reducing income, as evidenced by the actions of the governmental and regulatory bodies 
described above. We also posit that, in smoothing their income, banks prefer that their 
allowances for loan losses do not fluctuate too much, in order to avoid scrutiny by bank 
regulators, the SEC, auditors, analysts, and other market participants. We hypothesize that 
banks satisfy both these preferences by exercising discretion over loan charge-offs. During 
the 1990s boom, this usually involved banks accelerating loan charge-offs to reduce the 
allowance for loan losses. 

While banks' loan charge-offs are observable, this discretionary behavior likely is 
opaque to outsiders for the following two reasons. First, charge-offs can be volatile at the 
bank level even in good times, and discretionary charge-offs are not directly observable. 
Hence, the exercise of discretion can appear to be attributable to chance, at least for a 
while. Second—and more importantly given the prolonged period over which we hypoth- 
esize that banks smoothed income—bank regulators and analysts facilitate this discretionary 
behavior by using the ratio of tbe allowance for loans losses to net loan charze-offs to 
assess the adequacy of banks' allowances for loan losses.? Increasing net charge-offs lowers 
this ratio by both reducing its numerator and increasing its denominator, so a bank that 
that increases its charge-offs appears to have a less excessive allowance for loan losses. 

We test four hypotheses about how more profitable banks smoothed their income down- 
ward during the 1990s boom by accelerating provisions for and charge-offs of homogeneous 


8 See FFIEC (1999) February 10, 1999. In this guidance, the FFIEC requires credit card and other open-end 
consumer loans to be charged off no later than 180 days past due and closed-end consumer (e.g., auto) loans 
to be charged off no later than 120 days past due as of June 1999. These new rules do not eliminate differences 
in charge-off practices across banks, however, because they allow banks to adopt a “policy more zonservative 
than the one detailed." For example, SunTrust Banks continues to charge off closed-end consumer loans at 90 
(not 120) days past due, despite consistently recovering these charge-offs at about a 40 percent rate. 

9 See Federal Reserve Board (2005, Section 2072.1, 13), Office of the Comptroller of the Currency (1998, 38), 
and Ryan (2002, Chapter 5). 
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loans for which charge-offs are determined using number-of-days-past-due rules and so can 
be more easily accelerated. First, consistent with most prior empirical research, which finds 
that banks smooth their income, we hypothesize that banks' provisions for loan losses are 
positively associated with income before the provision for loan losses. Extending this prior 
research, we hypothesize that this association is stronger for more profitable banks that 
hold more homogeneous loans, consistent with greater income smoothing by these banks. 

Second, we hypothesize that more profitable banks that charged off more homogeneous 
loans had higher rates of recoveries of charge-offs, consistent with these banks accelerating 
charge-offs of these loans. As discussed above, recovery rates are used by bank regulators 
and analysts to assess the aggressiveness of banks’ charge-off policies.!? 

Third, we hypothesize that more profitable banks that recovered more charge-offs of 
homogeneous loans in a given year recorded more new gross loan charge-offs in that year, 
consistent with these banks reducing their allowances for loan losses in order to obscure 
their income smoothing across the prolonged 1990s boom. 

Fourth, we predict and find that profitable banks that charged off more homogeneous 
loans have higher and more persistent income before the provision for loan losses over the 
next three years, consistent with these banks smoothing income over a prolonged horizon. 


HL SAMPLE AND DESCRIPTIVE STATISTICS 

The sample covers the period from 1991—2000, which includes the entire 1990s boom. 
The number of commercial bank holding companies in the sample rises from 122 in 1991 
to 304 in 2000, peaking at 329 in 1999. Since the sample does not include the bust period 
up to 1990, we contrast our results to those of the prior research summarized in Section П 
to make inferences about changes over time in banks' management of provisions for loan 
losses and loan charge-offs. While this is not optimal, we view it as preferable to attempting 
to construct a sample of banks that includes the bust period for the following reasons. First, 
the bank regulatory data discussed below becomes available only in 1986, and its definitions 
of variables change at the beginning of our sample period in significant ways. In particular, 
the data by loan type discussed below became available in its current form either at the 
end of 1990 or in the first quarter of 1991. Second, Bank Compustat does not include non- 
surviving banks, and so yields a highly selected sample in the bust period. Third, there are 
nontrivial problems matching banks over a longer period given the volume of mergers and 
acquisitions in the banking industry. 

We obtain the following data by type of loan from commercial bank holding companies’ 
regulatory Y-9C reports available on the Federal Reserve Bank of Chicago’s website: loans 
outstanding, gross and net loan charge-offs, recoveries, and nonperforming assets. As in 
Liu and Ryan (1995), homogeneous loans include consumer loans, 1-4 family residential 
mortgages, loans to financial institutions, and acceptances of other banks. Heterogeneous 
loans include commercial and industrial loans, direct lease financing, all other real estate 
loans, agriculture loans, and foreign loans. We also obtain the allowance and provision 
for loan losses, net income, total assets, and owners’ equity from these reports. 


10 See Federal Reserve Board (2005, Section 2072.1, 13), Office of the Comptroller of the Currency (1998, 17), 
and Ryan (2002, Chapter 5). 

!! Liu and Ryan (1995) use more aggregated data on loans outstanding by type from Bank Compustat to calculate 
homogeneous and heterogeneous loans. The main difference between our classification scheme and that of Liu 
and Ryan (1995) is our more descriptive classification of 1-4 family residential mortgages as homogeneous, 
whereas Liu and Ryan are constrained to classify all real estate loans in a single category, which they choose 
to be heterogeneous. This classification difference has minimal effect on the results in this paper. 
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We obtain Tier 1 risk-based capital ratios from 1993—2000 from Bank Compustat, 
because these ratios are only available beginning in 1997 in the Y-9C reports. Since Tier 
1 risk-based capital ratios are not available prior to 1993 on Bank Compustat, in those 
years we proxy for it using equity divided by assets. 

Table 1 reports descriptive statistics for the variables in the regression analyses reported 
in Section IV as well as additional descriptive variables. Panel A reports the means and 
quartiles of these variables for the entire sample period. 

Panel B of Table 1 reports medians of the variables for the odd years in the sample 
period. The trends in the medians from 1991 to 1995 indicate that by no later than 1994 
banks were experiencing low levels of loan losses and were far better reserved for loan 
losses than during the pre-1990 bust. Specifically, the downward trends for the provision 
for loan losses (PLL) and net loan charge-offs (NLCO) both deflated by beginning loans 
outstanding, indicate that loan losses dropped by about 65 percent from 1991 to 1995. The 
upward trends for the ratios of the allowance for loan losses (ALL) to nonperforming assets 
(NPA) and to NLCO indicate that loss reserve adequacy improved by about 150 percent 
over this period. The ratio of recoveries to gross loan charge-offs, REC%, referred to as 
the recovery rate, is a measure of the aggressiveness of banks’ charge-off policies used by 
bank regulators and analysts. The upward trend for REC% indicates that the recovery rate 
doubled from 1991 to 1995, rising to a level of 32.5 percent in 1995 that is difficult to 
reconcile with charged-off loans being deemed uncollectible. 

Reflecting banks’ improving health, their return on assets, ROA, rose almost 40 percent 
from 0.96 percent in 1991 to 1.33 percent in 1995 and stayed at about that level throughout 
the remainder of the 1990s. The upward trend for the Tier 1 risk-based capital ratio, CAP, 
reported in Panel B is correct but overstated, because the ratio of book equity to book 
assets proxies for this variable in 1991. Bassett and Zakrajsek (2001) report that the average 
Tier 1 risk-based capital ratio for all U.S. chartered commercial banks rose approximately 
30 percent, from .081 in 1991 to 0.106 in 1994, and Panel B indicates that our sample 
banks stayed well capitalized through 2000. These statistics illustrate a highly profitable 
and well-capitalized banking industry by no later than 1994, conditions that persist through 
2000. 

Panel C of Table 1 reports medians of REC%, for groups of observations more and 
less likely to accelerate loan charge-offs to obscure their income smoothing. Using the prior 
year’s values of variables, we group observations into above- and below-median ROA,_, 
groups, into quintiles based on total gross loan charge-offs divided by loans outstanding, 
GLCO,..,, and into quintiles based on each of gross loan charge-offs of homogeneous and 
heterogeneous loans divided by loans outstanding, GLCO(HOM),_, and GLCO(HET),_.,, 
respectively. Consistent with our hypothesis that more profitable banks with higher charge- 
offs of homogeneous loans are more likely to be accelerating loan charge-offs, for the 
above-median КОА, , group, REC%, rises sharply and monotonically from 21.6 percent in 
the lowest СІСО(НОМ), , quintile to 34.6 percent in the highest GLCO(HOM ), , quin- 
tile. In contrast, REC%, has no obvious trend across the GLCO(HOM), , quintiles for the 
below-median ROA, , group. Also in contrast, REC%, generally decreases across the 
GLCO(HET), , quintiles for the above-median ROA, , group, and it has no obvious trend 
across these quintiles for the below-median ROA, , group. 

Panel D of Table 1 reports Pearson correlations of these variables, which indicate that 
many of our explanatory variables—most notably, the provision for loan losses and net 
loan charge-offs—are highly correlated. While these correlations do not pose problems in 
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TABLE 1 
Descriptive Statistics*><“ 
Panel A: Means and Quartiles of Variables in Pooled Sample 
Variables = Mean Мі 2% Ма 759. Мах #Оһз 
X .0223 —.0117 0177 0216 1260 .0822 2148 
КОА .0127 —.0197 .0103 .0128 .0155 .0392 1886 
PLL :0048 —.0041 .0019 .0033 .0053 .0476 2148 
NLCO .0038 —.0026 .0011 .0023 .0044 .0417 2148 
REC .0013 .0000 .0005 .0009 .0026 .0127 2159 
REC96 .3575 .0029 .1614 .2558 4033 4.5223 2137 
АМРА —.0004 —.0612 —.0020 .0000 .0021 .0413 2148 
HOM% 4731 0361 .3633 4775 5959 .9708 2145 
ALL/NPA 2.9521 1921 1.0458 1.9008 3.3003 50.2022 2105 
ALL/NLCO 8.0720 —62.2102 3.0375 5.4541 10.4358 77.7020 1751 
CAP 1156 0414 40850 .1099 1370 3071 1736 
Panel B: Medians of Variables in Odd Years of Sample Period 
Variables 1991 1993 1995 1997 1999 
Х 0219 40232 40219 0225 ‚0204 
КОА 0096 0129 0133 0134 40124 
PLL ‚0075 0040 .0029 .0032 .0029 
NLCO | .0058 .0028 :0022 .0021 0018 
КЕС .0012 .0012 .0010 .0008 .0006 
REC% .1576 2912 3253 2648 2474 
ANPA 0014 —.0023 —.0003 ‚ .0002 .0003 
HOM% 4763 5085 5064 4854 4663 
ALL!NPA -7615 1.2770 1.9847 2.2635 2.4508 
ALL/NLCO 2.5246 5.3861 6.2509 5.8624 6.1907 
CAP 40730 .1139 .1122 .1100 .1051 
max #obs 122 141 197 252 329 
Panel C: Median Recovery Rate (REC 96,) by Profitability and Charge-Off Groups 
Quintiles 

КОА,_, Quintiles Based on Variables 1009) 2 3 4 || 5 (high) 
Above median | GLCO, , .2198 2101 .2370 .2646 .2658 

GLCO(HOM),_; 2158 2244 | 2579 3007 .3457 

GLCO(HET), , 3529 2663 2271 237 .2146 
Below median GLCO,, .2894 2960 .2832 .2707 .2607 

СІСО(НОМ),_ .2647 3563 .2467 3000 2560 

GLCO(HET),_, 2831 2717 2389 2850 .2563 


(continued on next page) 
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TABLE 1 (Continued) 
Panel D: Pearson Correlations of Variables in Pooled Sample 


Variables (29 (9 Ф о (9 о 68) (0 W Gap 
X (1) 90 32 26 17 -00 -05 -47 15 -406 -06 
ROA (2) -11 -06 10 06 -05 -10 22 05 -05 
PLL (3) 8633 27 -02 —10 —03 —23. —02 
NLCO (4) AY 533. —-18 -04 -06 S96: -04 
REC (5) 36  -23 -08 -0 -24  -04 
REC% (6) -02 -4 10 -04 44 
ANPA (7) 02 -00 01 02 
HOM®% (8) 00 143 —05 
ALLINPA (9) 18  .06 
ALL/NLCO (10) 73 
CAP (11) 


* All variables are collected from commercial bank holding companies’ regulatory Y-9C filings except for Tier 1 
risk-based capital, which is collected from Bank Compustat. The sample covers the period 1991—2000. 
> The descriptive statistics reflect all observations of any variable included in any subsequent regression analysis. 
Observations in the outside 0.5 percent of either tail of the distribution of any variable in a given analysis or 
for which total assets or loans outstanding on the regulatory Y-9C reports differs from the value on Bank 
Compustat by more than 10 percent are deleted. 
© Variable definitions: 
X = net income before the provision for loan losses divided by beginning of year total loans; 
ROA = net income divided by beginning of year total assets; 
PLL = provision for loan losses divided by beginning of year total loans; 
GLCO = gross loan charge-offs divided by beginning of year total loans; 
NLCO = net loan charge-offs divided by beginning of year total loans; 
REC = recoveries divided by beginning of year total loans; 
REC% = recoveries divided by gross loan charge-offs; 
ANPA = change in nonperforming assets divided by beginning of year total loans; 
HOM% = percentage of total loans outstanding that are homogeneous loans (consumer loans, 1—4 family 
residential mortgages, loans to financial institutions, or acceptances of other banks); 
ALL/NPA = allowance for loan losses divided by nonperforming assets; 
ALL/NLCO = allowance for loan losses divided by net loan charge-offs; 
CAP = Tier 1 risk-based capital ratio; and 
max #obs = the maximum number of observations of any variable in a given year (which typically exceeds 
the minimum number of observations of any variable in that year by 10—20 percent). 
4 A variable name followed by HOM or HET in parentheses denotes the value of that variable for homogeneous 
or heterogeneous loans, respectively. For example, GLCO(HOM) denotes gross loan charge-offs of 
homogeneous loans. 


our pooled estimations, they preclude reliable year-by-year estimations of the models, es- 
pecially in the early years in our sample period that have about one-third the observations 
of later years. 


IV. MODELS AND RESULTS 

Since our single-industry sample exhibits cross-sectional dependence, we estimate each 
regression model with fixed time effects. To mitigate heteroscedasticity, we deflate unscaled 
variables by beginning loans outstanding and delete observations of each variable in the 
outside 0.5 percent of each tail of its distribution each year. Our regression results are not 
sensitive to the treatment of outliers as long as extreme observations are deleted or pulled 
in; for example, deletion of influential outliers identified based on Belsley et al. (1980) 
diagnostics has minimal effect on our results. 
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Income Smoothing Using Provisions for Loan Losses 


In this section, we test the hypothesis that during the 1990s boom, banks’ provisions 
for loan losses are positively associated with their income before the provision for loan 
losses, consistent with banks managing provisions to smooth their income, and that this 
association is stronger for more profitable banks that hold more homogeneous loans. We 
conduct these tests using the following variant of a model developed by Ahmed et al. 
(1999):12 


PLL, = а + Ы, + CHOM%, + dX, + ЈУ ыд + 8(X*HOM%,) 
+ hCAP, + ЈАМРА, + e, (1) 


Thighroat 1$ ап indicator variable that takes a value of 1 if a bank has an above-median return 
on assets in year г, and 0 otherwise. HOM%, denotes the percentage of homogeneous loans 
in the bank's loan portfolio. X, denotes income before the provision for loan losses.!? 

The hypothesis stated above has three main implications for the coefficients in Equation 
(1). First, the coefficient on X, should be positive, consistent with banks smoothing their 
income using provisions for loan losses during the 1990s boom. Second, the coefficient on 
X,*L,,",; Should be positive, consistent with more profitable banks more aggressively 
smoothing income (downward) during this period. Third, the coefficient on X,*HOM®, 
should be positive, consistent with banks that hold more homogeneous loans more aggres- 
sively smoothing income during this period. 

While our hypothesis pertains to the coefficients on the variables in which Г, and 
HOM®, are interacted with Х,, we also include І, па , and HOM%, separately in Equation 
(1) to ensure that the coefficients on the interaction terms do not reflect any direct effects 
of profitability and loan portfolio composition on provisions for loan losses. Ahmed et al. 
(1999) include CAP, to control for banks recording provisions to raise regulatory capital 
and ANPA, to control for economic loan default during the period, and they find a negative 
coefficient on CAP, and a positive coefficient on ANPA,. 

PLL, X,, and ANPA, are deflated by loans outstanding at the beginning of year t. The 
other variables in Equation (1) are naturally deflated or scale-free. 

Table 2 reports the fixed time effects estimation of Equation (1). As predicted, the 
coefficient on X, is significantly positive (t — 1.8), consistent with banks smoothing their 
income using provisions for loan losses during the 1990s boom. As predicted, the coefficient 
on Х"Ривћтоалг 15 Significantly positive (t = 3.5), consistent with more profitable banks more 
aggressively smoothing income using provisions for loan losses than less profitable banks 
during the 1990s boom. The coefficient оп Т, is significantly positive (t = 2.9), also 


12 Equation (1) differs from the model in Ahmed et al. (1999) in the following ways. First, we do not include а 
difficult-to-collect control variable, the change in per capita dollar liabilities of failed businesses weighted by 
the geographic distribution of loans, which they found to be not significant. Second, we do not include the 
implied standard deviation of return on assets, which they use to proxy for the change in the risk of the loan 
portfolio, because this variable cannot be calculated for almost 20 percent of the sample due to insufficient 
returns data. We did conduct sensitivity analysis adding this variable for the subsample for which it is available, 
however, and the conclusions from our hypothesis tests are not altered. Third, we do not include certain inter- 
action terms involving X and CAP that relate to their time-period-specific hypotheses. Fourth, we include 1, 

by itself and interacted with X to test hypotheses regarding bank profitability. Fifth, we include HOM% by itself 
and interacted with X to test hypotheses regarding homogeneous loans. 

For simplicity, we use income before the pretax provision for loan losses throughout the paper. We tested the 
sensitivity of our results to using earnings before 0.6 times the provision for loan losses, and found no significant 
differences in the conclusions from our hypothesis tests (the coefficients on the provision for loan losses are 
algebraically affected, of course). 
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TABLE 2 
Regression of the Provision for Loan Losses on Earnings before the Provision for Loan Losses 
with Profitability and Loan Portfolio Composition Interactions*>*4 


PLL, = а + Ы, + CHOM%, + dX, + ЯХ, о) + 9(X,HOM%,) 


+ АСАР, + jANPA, + e. 5 (1) 
Hypotheses Coefficients 
у (er .008* 
(2.9) 
HOM% —.011 
(—1.6) 
X, + .124* 
(1.8) 
Хом + .718* 
(3.5) 
X HOM% + .573* 
(2.2) 
CAP, 4015 
(1.2) 
ANPA, —.039 
(—1.5) 
R2 27 
# obs 1739 


* White’s heteroscedasticity-adjusted t-statistic (shown in parentheses) is significant at the 5 percent level or 

better. 

* The sample is described in the notes to Table 1. 

5 All variables are defined in the notes to Table 1 except for Гы which takes a value of 1 if an observation 
has above median return on assets in year г, and 0 otherwise. 

* The equation is estimated with fixed time effects, which are not tabulated. 

8 The К? does not include the explanatory power of the fixed effects. 


possibly reflecting banks’ income smoothing. As predicted, the coefficient on X,*HOM%, 
is significantly positive (t = 2.2), consistent with the hypothesis that this income smoothing 
is stronger for banks with more homogeneous loans. 

In contrast to Ahmed et al.’s (1999) finding of significant coefficients on CAP, and 
ANPA,, we find none of the coefficients on the control variables CAP,, ANPA,, and HOM%, 
are significant. This lack of significance suggests that banks determined their provisions for 
loan losses primarily based on their profitability during the 1990s boom, perhaps because 
they were well capitalized and experienced low levels of loan default during that period. 

In summary, in this section we find that the provision for loan losses is positively 
correlated with income before the provision for loan losses, consistent with banks smooth- 
ing their income using provisions for loan losses during the 1990s boom. We find this 
income smoothing behavior is stronger for more profitable banks holding more homoge- 
neous loans. 


Acceleration of Loan Charge-Offs 


In this section, we test the hypothesis that more profitable banks accelerated charge- 
offs of homogeneous loans in order to keep their overstated allowances for loan losses from 
ballooning during the 1990s boom. We conduct these tests using the following model: 
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REC, = а + БНОМ9, | + сы, + dGLCO(HET), | 
+ ДСІСО(НЕТ), мые + 8 О РСО(НОМ),_; 


3 
+ h[GLCO(HOM),_.:*Tpighroa,,) + 7САР, + У KANPA,., + е. (2) 
s-0 


We infer that banks have charged off loans more quickly when their gross loan charge-offs 
have a more positive association with one-year-ahead recoveries, denoted КЕС,.! We dis- 
tinguish gross loan charge-offs for heterogeneous and homogeneous loans, denoted 
GLC(HET),., and GLCO(HOM ), ,, respectively, because we expect banks to have more 
ability to accelerate charge-offs of homogeneous loans than heterogeneous loans. Because 
we expect that more profitable banks accelerated charge-offs more than less profitable banks 
during the 1990s boom, we interact these gross loan charge-off variables with Z igaroar 

The coefficients on the gross loan charge-off variables in Equation (2) reflect the rates 
of recovery in year г of incremental charge-offs in year 1—1. Since some level of recoveries 
is to be expected even in the absence of discretion, the coefficients on both GLCO(HET),_, 
and GLCO(HOM),.., should be positive. The hypothesis stated above has two main impli- 
cations for these coefficients. First, the coefficient on GLCO(HOM ), ("Ч,дад-л Should be 
positive, consistent with more. profitable banks accelerating charge-offs for homogeneous 
loans more than less profitable banks. Second, the sum of the coefficients on 
GLCO(HOM ),., and GLCO(HOM , ,*1,,,:,5,-1 Should be larger than the sum of the coef- 
ficients on GLCO(HET), , and GLCO(HET), , "Tnighroar-v 16» 5 + В > d + f, consistent 
with more profitable banks accelerating charge-offs more for homogeneous loans than for 
heterogeneous loans. 

We include HOM%,_, and 1,,,,,,,-, separately in Equation (2) to ensure that the co- 
efficients on the interaction variables do not reflect any direct effects of loan portfolio 
composition and profitability on recoveries. We include CAP, because Moyer (1990), Beatty 
et al. (1995), and Collins et al. (1995) predict and generally find that loan charge-offs are 
used to manage regulatory capital. We include current and three lagged changes in non- 
performing assets, ANPA, ,, 0 = s = 3, to control for banks’ economic exposure to loan 
losses, because rionperforming assets is the best available measure of those losses. 

REC, GLCO(HET), ,, GLCO(HOM ),.,, and ANPA,., are deflated by loans outstanding 
at the beginning of year t—3. The other variables i in Equation (2) are naturally deflated or 
scale-free. 

Table 3 reports the fixed time effects estimation of Equation (2). As expected even in 
the absence of discretion, the! coefficients on GLCO(HET),_, and GLCO(HOM ),., are sig- 
nificantly positive (t = 7.1 and 11.4, respectively). These coefficients imply that gross 
charge-offs for heterogeneous loans are recovered 11.9 percent of the time, while 
gross charge-offs for homogeneous loans are recovered at the much higher rate of 23.8 
percent. 

As predicted, the coefficient on GLCO(HOM ), Я Tnighroar-1 15 significantly positive (t 
— 2.2), consistent with more profitable banks accelerating charge-offs of homogeneous 


14 We are able to demonstrate excessive loan charge-offs by healthy banks during the 1990s in various ways other 
than the association between gross loan charge-offs and future recoveries (с.р. by a rising ratio of net loan 
charge-offs to nonperforming assets). We chose recoveries as our dependent variable because of bank regulators’ 
and analysts’ use of recoveries to assess the aggressiveness of banks’ loan charge-off policies, as discussed in 
Section П. 

13 We do not expect loan charge-offs to be delayed to increase regulatory capital after the first “ш of years in 
our sample period, however, because virtually all banks are well capitalized by no later than 1994. 
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TABLE 3 
Regression of Recovery Rate on Heterogeneous and Homogeneous Loan Charge-Offs with 
Profitability Interactions*>*# 
REC, = а + БНОМФ, , + clu os, + EGLCO(HET),_; 
+ fIGLCO(HET), ,* lupus] + gGLCOGIOM),., 


3 
+ h[GLCO(HOM), - Чанов, 1 + JCAP, + DK ANPA,., + е. (2) 
Hypotheses Coefficients 
HOM% —.014 
(—.4) 
И. —.007 
(—.8) 
СІСО(НЕТ), .119* 
(7.1) 
GLCO(HET),_.:*Tpighroas-1 .046 
(1.4) 
СІСО(НОМ),_, 2385 
(11.4) 
СІСО(НОМ), ЧІ eghroat—1 + .083* 
(2.2) 
CAP, , .010* 
(3.4) 
ANPA, —1.642* 
(—2.8) 
ANPA,_, —2.610* 
(—4.1) 
ANPA, , —2.017 
| (-3.1) 
ANPA,_, —1.479 
(—2.1) 
g+h-d-f + .156 
(3.6) 
R? 47 
# obs 1709 


* White’s heteroscedasticity-adjusted t-statistic (shown in parentheses) is significant at the 5 percent level or 
better. 


* The sample is described in the notes to Table 1. 

^ All variables are defined in the notes to Tables 1 and 2. 

* The equation is estimated with fixed time effects, which are not tabulated. 
4 The R? does not include the explanatory power of the fixed effects. 


loans more than other banks. This result implies that above-median-profitability banks re- 
covered gross loans charge-offs of homogeneous loans at an 8.3 percent higher rate than 
below-median-profitability banks. Also as predicted, the difference between the sum of the 
coefficients on GLCO(HOM), ү and GLCO(HOM ), |" Гу... апа the sum of the coef- 
ficients on GLCO(HET),., and GLCO(HET), |" Tnighroas—1 18 Significantly positive (t = 3.6), 
consistent with more profitable banks accelerating charge-offs of homogeneous loans more 
than heterogeneous loans. 
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The coefficient on the control variable CAP,_, is significantly positive (t = 3.4), im- 
plying better capitalized banks recorded charge-offs more quickly than did poorly capital- 
ized banks.!$ The coefficients on ANPA,_,, 0 = s = 3, are all significantly negative, con- 
sistent with these variables capturing economic loan default. The coefficients on HOM%,_, 
and /,,„ьъаг-1 аге not statistically significant. 

In summary, the descriptive statistics in Table 1, Panel B discussed above show that 
banks experienced sharply increasing recovery rates during the early 1990s, consistent with 
banks charging off loans too quickly. Consistent with the descriptive analysis of recovery 
rates in Table 1, Panel C, in this section we find that, superimposed on this overall trend, 
recovery rates were higher for more profitable banks holding more homogeneous loans. 
These results are consistent with loan charge-offs being substantially determined by more 
profitable banks' discretionary behavior during the 1990s boom. 


Sustaining the Income Smoothing 

An alternative explanation for the high recovery rates for profitable banks charging off 
homogeneous loans is that these banks were consistently surprised by the persistently fa- 
vorable economic conditions during the extended 1990s boom, more so than other banks. 
In this section, we attempt to rule out this alternative explanation and demonstrate how 
these banks obscured their income smoothing over such an extended period. Specifically, 
we test the hypothesis that during the 1990s boom profitable banks experiencing recoveries 
of homogeneous loans recorded new gross loan charge-offs in order to reduce their allow- 
ances for loan losses. 

To test this hypothesis, we estimate the following model: 


GLCO, = а + ЬНОМФ, + cl, ios, + dREC(HET), + f(REC(HET),*Iyighroa] 
+ gREC(HOM), + hREC(HOM) ss] 
+ jCAP, + kPLL, + IALL,., + mANPA, + e, (3) 


The model captures the association of gross loan charge-offs with contemporaneous recov- 
eries of charge-offs of both heterogeneous and homogeneous loans, REC(HET), and 
REC(HOM ,, respectively, distinguishing above- and below-median-profitability banks. 

The coefficients on the recovery variables in Equation (3) reflect the rates of new gross 
loan charge-offs per incremental dollar of recoveries. The hypothesis stated above has two 
main implications. First, the coefficient on КЕС(НОМ), 1, „а, should be positive, consis- 
tent with more profitable banks recording more new charge-offs when they experience 
recoveries of homogeneous loans than do less profitable banks. Second, the sum of 
the coefficients оп REC(HOM), and КЕС(НОМ) Ч, даг Should be larger than the sum 
of the coefficients on REC(HET), and REC(HET),* I, oso 16» g + > d + Ё, consistent 
with more profitable banks recording more new gross loan charge-offs when they experience 
recoveries of homogeneous rather than heterogeneous loans. 

We include HOM%, in Equation (3) to control for loan portfolio composition. We 
include /,;,,,,, , separately to ensure that the coefficients on the interaction variables do not 
capture any direct effect of profitability. We include CAP, to control for the previously 


16 This result could be attributable to more poorly capitalized banks delaying charge-offs to increase regulatory 
capital, as the research discussed above generally finds happened during the bust period prior to 1990. Given 
that virtually all the banks in our sample are well capitalized, however, it is more likely that the better capitalized 
banks accelerated charge-cffs. 
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discussed finding of prior research that banks use loan charge-offs to manage regulatory 
capital. We include PLL, and ANPA, to control for new information about loan default during 
the year. We include ALL, , in the model to control for prior estimates of loan default. 

GLCO, КЕС(НЕТ), КЕС(НОМ), PLL, ALL, ,, and ANPA, are deflated bv loans out- 
standing at the beginning of year t. The other variables in Equation (3) are naturzlly deflated 
or scale-free. 

Table 4 reports the fixed time effects estimation of Equation (3). As predicted, the 
coefficient on REC(HOM),*1,,,,,,4, 18 Significantly positive (t = 1.9), consistent with more 
profitable banks recording more new charge-offs when they experience recoveries of ho- 
mogeneous loans than less profitable banks. Also as predicted, the difference tetween the 


TABLE 4 
Regression of Gross Loan Charge-Offs on Recoveries of Heterogeneous and Homogeneous 
Loans with Profitability Interactions*^^4 
GLCO, = а + БНОМФ, + c п, + dREC(HET), + fIREC(HET), * Тыа 
+ gREC(HOM), + h[REC(HOM), * Lusso] + 


+ jCAP, + kPLL, + lALL, + mANPA, + e, (3) 
Hypotheses Coefficients 
HOM% —.027 
(—1.5) 
1 .035* 
ere (2.5) 
REC(HET), 547 
(.8) 
КЕС(НЕТ) Ч, од 1.022 
(1.0) 
REC(HOM), 2.980* 
(5.5) 
REC(HOM) I, iron. + 1.468* 
(1.9) 
САР, —.099 
(—.8) 
PLL, 478" 
(4.0) 
ALL, , .059 
(1.6) 
ANPA, —.022 
(—.3) 
g+h-d-f + 2.879* 
(1.8) 
R? 97 
#obs 1729 


* White's heteroscedasticity-adjusted t-statistic (shown in parentheses) is significant at the 5 percent level or 
better. 

* The sample is described in the notes to Table 1. 

* All variables are defined in the notes to Tables 1 and 2. 

* The equation is estimated with fixed time effects, which are not tabulated. 

d The R? does not include the explanatory power of the fixed effects. 
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sum of the coefficients on REC(HOM), and REC(HOM ),*I,,,,,,, апі the sum of the coef- 
ficients on REC(HET), and REC(HET),*Tyighroa: 1$ Significantly positive (t = 1.8), consistent 
with more profitable banks recording more new gross loan charge-offs when they recover 
homogeneous loans than when they recover heterogeneous loans. 

The coefficient оп Га is significantly positive (t = 2.5), likely reflecting some of 
the hypothesized discretionary behavior. The coefficient on PLL, is significantly positive (t 
— 4.0), reflecting the well-known high positive correlation between charge-offs and pro- 
visions for loan losses. The coefficients on HOM%, САР, ALL, ,, and ANPA, are all not 
statistically significant. 

In summary, in this section we find that during the 1990s boom profitable banks that 
experienced more recoveries of charge-offs of homogeneous loans recorded more new gross 
loan charge-offs in the same year. These results are consistent with the hypothesis that 
banks recorded those loan charge-offs to reduce their allowances for loan losses and thereby 
obscure their downward income smoothing during that period. 


The Level and Persistence of Net Income 


Due to the prolonged nature of the 1990s boom, profitable banks smoothing their 
income downward had to do so over a fairly (and perhaps unexpectedly) long horizon. In 
this section, we test the hypothesis that profitable banks that charged off more homogeneous 
loans during this period had higher and more persistent income before the provision for 
loan losses over the next three years, consistent with them smoothing income over a pro- 
longed horizon. 

We test this hypothesis using the following model: 


X4, = а + bHOM%, + c NLCO(HET), + dNLCO(HOM), + fX, 
+ g(X, * HOM%,) + НІХ, * NLCO(HET)] + ЛХ, * NLCO(HOM),] 
+ КСАР, + IPLL, + тАМРА, + epp 1 8 5 53. (4) 


This model regresses income before the provision for loan losses for each of the next three 
years, X,,,, 1 = $ = 3, on net loan charge-offs for both homogeneous and heterogeneous 
loans, denoted NLCO(HET), and NLCO(HOM),, respectively, both separately and interacted 
with current income before the provision for loan losses, X,. 

Based on the hypothesis stated above, we expect positive coefficients on NLCO(HOM),, 
indicating that higher net charge-offs of homogeneous loans are associated with higher 
future income before the provision for loan losses. We also expect positive coefficients on 
X,*NLCO(HOM),, indicating that higher net charge-offs for homogeneous loans are asso- 
ciated with more persistent income before the provision for loan losses. 

We include X, in Equation (4) to capture the average level of income persistence for 
the sample. We include HOM%, and X,*HOM^*&, to control for the effect of loan portfolio 
composition on the level and persistence of income. We include CAP, to capture the pre- 
viously discussed finding of prior research that banks use loan charge-offs to manage reg- 
ulatory capital. We include PLL, and ANPA, to control for new information about loan 
default during the year. 

The variables X,,,, 0 = s = 3, NLCO(HET), NLCO(HOM), PLL, and ANPA, are 
deflated by loans outstanding at the beginning of year t. The other variables in Equation 
(4) are naturally deflated. 

Table 5 reports the fixed time effects estimation of Equation (4). As predicted, the 
coefficient on NLCO(HOM), is significantly positive for the two- and three-year horizons 
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TABLE 5 
Regression of Future Earnings before the Provision for Loan Losses on Current Net Loan 
Charge-Offs of Heterogeneous and Homogeneous Loans and Earnings before the Frovision for 
Loan Losses with Interactions between Charge-Offs and Earnings*^^4 


X,, = а + ЪНОМФ, + cNLCO(HET), + dNLCO(HOM), + fX,  g(X, * HOM%,) 
+ ВХ, * NLCO(HET), + ДХ," NLCO(HOM),] + kCAP, + IPLL, 


+ mANPA, + €4,1<s <3. (4) 
Coefficients 
Hypotheses МИР. СУ Хаз _ __ Хаз _ 
ном», —.001 —.001 -004 
(-1.0) (=) (-13) 
NLCO(HET), —.186 227 211 
(—1.1) (1.1) (.8) 
NLCO(HOM), + 142 7115 .663* 
(.9) (3.0) (2.4) 
X, .952* 1.087* 1.240* 
(29.2) (21.5) (15.9) 
X*HOM*6, —.491* —.607* —.591* 
(—3.0) (—3.2) (—2.0) 
X*NLCO(HET), —6.167 —31.892 —56.546* 
(—1.2) (—1.5) (—4.0) 
X;*NLCO(HOM), + 19.718* 45.234* 82.764* 
(2.7) (2.7) (3.5) 
CAP, —.003 —.009 —.018 
(—1.0) (—1.3) 1—1.5) 
PLL, 050 —.600* —.763* 
(.3) (—3.7) 1-3,0) 
ANPA, —.023 —.015 —.126 
(—.б) (—.4) 7-11) 
R? .69 :93 41 
# obs | 1680 1363 1115 


* White’s heteroscedasticity-adjusted t-statistic (shown in parentheses) is significant at the 5 percent level ог 
better. 


* The sample is described in the notes to Table 1. 

è All variables are defined in the notes to Tables 1 and 2. 

* The equation is estimated with fixed time effects, which are not tabulated. 
* The К?з do not include the explanatory power of the fixed effects. 


(t = 3.0 and 2.4, respectively), consistent with banks recording more charge-offs of ho- 
mogeneous loans having higher future income before provisions for loan losses than other 
banks, though this coefficient is positive but not significant for the one-year horizon. In 
contrast, the coefficient on NLCO(HET), is not statistically significant. Also as predicted, 
the coefficient оп X,*NLCO(HOM ), is significantly positive for each of the one-year to 
three-year horizons (t = 2.7, 2.7, and 3.5, respectively), consistent with banks recording 
more charge-offs of homogeneous loans having more persistent future income before pro- 
visions for loan losses than other banks. In contrast, the coefficient on X,*NLCO(HET), is 
always negative, and significantly so at the three-year horizon. 


The Accounting Review, March 2006 


Income Smoothing over the Business Cycle 439 


The coefficient on X, is significantly positive and generally in excess of 1, consistent 
with the sample banks’ income before the provision for loan losses having high persistence 
during the 1990s boom. The coefficients on PLL, and ANPA, are either not significant or 
significantly negative. The coefficient оп НОМ9 is not significant. 

In summary, in this section we find that, during the 1990s boom, banks recording more 
charge-offs of homogeneous loans had higher and more persistent future income before 
provisions for loan losses over the next three years than other banks. These results are 
consistent with these banks smoothing income over a prolonged horizon." 


V. CONCLUSION 

In this paper, we illustrate the contextual and asymmetric nature of income smoothing 
in the setting of banks' provisions for loan losses and loan charge-offs. Our evidence in- 
dicates that during the 1990s boom profitable banks smoothed their income downward by 
overstating provisions for loan losses for homogeneous loans, and they obscured this in- 
come smoothing by accelerating charge-offs of those loans and by recording more gross 
charge-offs when they inevitably recovered more charge-offs of these loans. Our results are 
in striking contrast with those of prior research that banks delayed provisions for loan 
losses primarily for heterogeneous loans during the pre-1990s bust period. Collectively, our 
results and those of prior research indicate that banks' exercise of discretion over provisions 
for loan losses and loan charge-offs is constrained by whether they hold homogeneous or 
heterogeneous loans, and that these constraints bind differently in the bust and boom phases 
of the business cycle. 

While we examine a specific context, contextual and asymmetric income smoothing is 
likely to occur beyond the banking industry and as a result of firm-specific or industry- 
level rather than macroeconomic phenomena. We believe that research on income smooth- 
ing would benefit in terms of economic interest, faithfulness to the rules and practices of 
accounting, and statistical power by seeking out and documenting discretionary behavior 
that manifests itself in more contextual and less purely symmetrical ways than is captured 
in the usual research designs. 

Our results also bear on the recent debate about rules- versus principles-based account- 
ing standards. While rules-based standards are often thought to yield less discretionary 
accounting, our results suggest that an accounting rule that appears particularly nondiscre- 
tionary—the charging off of homogeneous loans more than a given number of days past 
due—leads to more discretion in certain contexts. | 
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ABSTRACT: Despite. the classification of the cash flows on trading positions as op- 
erating under SFAS No. 102, trading Is economically a hybrid operating/non-operating 
activity. Reflecting this hybrid nature, we hypothesize and find that the change In net 
trading assets has a less positive association with returns and future CFO than do the 
pure operating components of cash flows and accruals, and it has a more positive 
association with returns and future CFO than do the pure non-operating components 
of cash flows. Our paper Is the first to propose and test hypotheses about the valuation 
implications of such hybrid cash flows and accruals. 
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I. INTRODUCTION 
n this paper we investigate whether the market prices the change in net trading assets 
[Г assets minus trading liabilities)! as an operating or non-operating activity or 
mixture of the two, and whether this market pricing is consistent with the association 
of the change in net trading assets with future cash flow from operations (CFO). Our 
investigation is motivated by the observation that—despite the classification of the cash 


! Trading assets include trading securities, derivative assets not designated as accounting hedges, and other trading 
assets (e.g., commodity contracts). Trading liabilities include derivative liabilities not designated as accounting 
hedges and other trading liabilities. Throughout our sample period 1991—2003, banks were required to account 
for trading assets and liabilities at fair value under the AICPA Audit and Accounting Guides for Banks and 
Savings Institutions and for Brokers: and Dealers in Securities or other accounting standards (e.g., SFAS No. 
115, FASB 1993). : 
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flows on trading positions as operating under SFAS No. 102—trading is economically a 
hybrid operating/non-operating activity. This is because the change in net trading assets 
equals the net unrealized gain, a primarily operating item, plus the net principal cash out- 
flow, a primarily non-operating item, on trading positions during the period. Our hypotheses 
and tests are based on the maintained assumption, supported by theory and empirical re- 
search, that operating activities have stronger implications for returns and future CFO than 
do non-operating activities. 

The operating aspects of trading pertain to the generation of trading revenue, which 
includes trading gains and losses as well as fees (e.g., commissions or spreads) for trading- 
related activities such as dealing. Accounting standards focus on these aspects of trading. 
For example, SFAS No. 115 (FASB 1993) defines trading as “active and frequent buying 
and selling ... with the objective of generating profits [i.e., gains] on short-term differences 
in price." SFAS No. 119 (FASB 1994) includes dealing under its definition of trading. The 
non-operating aspects of trading pertain to the principal cash flows at the inception of 
investments in trading assets or fundraising through trading liabilities and to the subsequent 
return or repayment of that principal. In practice, however, the operating and non-operating 
aspects of trading are inseparable, because the generation of trading revenue often requires 
traders to assume trading positions with initial principal (i.e., trading assets and liabilities 
with non-zero principal amounts at inception). 

Based on the accounting literature, financial economic theories, and empirical findings 
summarized in Sections II and III, we expect that share returns and future CFO are posi- 
tively associated with current CFO and accruals, because the operating activities that give 
rise to CFO and accruals tend to be positive present value and persistent. In contrast, we 
expect that returns and future CFO have a near zero association with cash flows from 
investing (CFI) and financing (CFF), because the non-operating activities that give rise to 
CFI and CFF tend to be close to zero net present value. Reflecting the hybrid nature of 
trading, we hypothesize and find that the change in net trading assets has a less positive 
association with returns and future CFO than do the pure operating components of CFO 
and accruals and a more positive association with returns and CFO than do CFI and CFF. 
Our paper is the first to propose and test hypotheses about the valuation implications of 
such hybrid cash flows and accruals. 

While our research question applies to all firms engaging in trading, we examine a 
sample of 37 U.S. banks that hold appreciable amounts of trading assets. We chose this 
sample because it is the largest available set of fairly homogeneous, publicly traded firms 
with appreciable trading positions. 

We conduct three additional empirical analyses to investigate further the valuation im- 
plications of hybrid operating/non-operating flows. First, as a benchmark for our results 
regarding trading positions, we examine the associations between both returns and future 
CFO and the change in held-for-sale (HFS) loans. HFS loans constitute the closest and 
economically most important analogue to trading positions for our banking sample. Like 
trading, originating HFS loans is a hybrid operating and non-operating activity. We expect 
the non-operating aspects of HFS loans to be relatively more important than for trading, 
however, because most HFS loans are residential mortgages or consumer loans that are 
originated and sold in highly competitive markets, rendering this activity closer to zero 
present value. We hypothesize and find that returns and future CFO are less positively 
associated with the change in HFS loans than with the change in net trading assets. 

Second, in our primary empirical analysis described above, we examine the valuation 
implications of the total change in net trading assets, not its primarily operating (net un- 
realized gain) and non-operating (principal cash flow) components. We do this in part 
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because we cannot separately observe unrealized gains and the principal cash flows on 
trading positions and in part because of the inseparability of the operating and non-operating 
aspects of trading. We are able to observe trading revenue for a subset of observations, 
however, and for these observations we use this variable as a proxy for net unrealized gains 
on net trading assets during the period. We hypothesize and find that returns and future 
CFO are positively associated with trading revenue controlling for the change in net trading 
assets. 

Third, to address potential heterogeneity in our sample and to examine whether the 
market differentiates banks based on the relative operating nature of their activities, we 
decompose the sample into six large derivatives dealers with both trading assets and lia- 
bilities and 31 non-dealer banks with only trading assets. Due to the dealers’ greater focus 
on fee generation, we expect their activities to be more operating than those of non-dealer 
banks. We hypothesize and find that returns and future CFO are more positively associated 
with the pure operating components of CFO and accruals and with the change in HFS loans 
for dealers than for non-dealer banks. Unexpectedly, we find that the change in net trading 
assets exhibits no differential market pricing and only a marginally significantly different 
association with future CFO for dealers and non-dealer banks. 

Our results generally support the conclusions that the market appreciates the hybrid 
nature of the cash flows and accruals associated with trading positions and HFS loans and 
that it differentiates banks based on the relative operating character of their activities. This 
likely reflects the fact that these items are disclosed separately from other cash flows and 
accruals in banks’ financial reports, and so investors can adjust their valuation analyses for 
these items. Our results imply that it is important that the cash flows and accruals for hybrid 
activities be reported in a disaggregated fashion on the statement of cash flows (SCF) and 
elsewhere in financial reports, regardless of how these items are classified. 

Our results bear on two related, important, and timely financial reporting issues. First, 
the proper classification of cash flows on trading positions is of current interest due to 
recent highly publicized cases in which Enron and other trading firms issued liabilities that 
provided them with cash at inception—prepaid derivatives or commodity contracts (pre- - 
pays)—that they classified as trading and for which they classified the initial principal cash 
flow in CFO. In a prepay, one firm receives cash at the inception of the contract in exchange 
for the promise of future delivery of cash, commodities, or other assets to its counterparty. 
As such, prepays involve financing. 

‚ Addressing the cash flow classification issues raised by prepays, SFAS No. 149 (FASB 
2003) requires that if a derivative contains “ап other-than-insignificant financing element 
... at inception, then the borrower shall report all cash inflows and outflows associated with 
that derivative" as financing. It seems likely that SFAS No. 149's classification requirement 
will be applied by analogy to all trading liabilities, prepays in particular, regardless of 
whether they are deemed derivatives for accounting purposes. This classification is in con- 
trast to the operating classification of the principal cash flows on trading liabilities during 
our sample period. Our analysis and results imply that the classification of the hybrid cash - 
flows on trading positions as either operating or non-operating is limited. We discuss pos- 
sible remedies for this problem in the conclusion. 

Second, the distinctions between operating and non-operating activities and thus be- 
tween operating and non-operating cash flows made in SFAS No. 95 (FASB 1987) are 
inherently blurry for the large and increasingly economically important set of trading firms 
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and financial institutions whose operations primarily involve investing and financing.” For 
such firms, operating and non-operating activities are better viewed as ends of a continuum, 
with different types of activities falling at different places along the continuum. In this 
study, we focus on the market’s perception of where trading, which SFAS No. 102 views 
as an operating activity, falls along this continuum. We adopt this focus because in our 
view trading constitutes the most salient example of the phenomena we wish to describe 
and explain.? 

The remainder of the paper is organized as follows. Section II summarizes the related 
research literatures and our contributions to them. Section III discusses GAAP and ac- 
counting practices regarding SCF classification for trading positions and HFS loans during 
our sample period, and it motivates and states our hypotheses. Section IV describes our 
sample, defines our cash flow and accrual components, and discusses the descriptive sta- 
tistics. Section V develops our returns and future CFO regression equations, restates our 
hypotheses as coefficient restrictions in those equations, and reports the estimation results. 
Section VI reports the results of various specification analyses. Section VII concludes. 


II. PRIOR RESEARCH 

Our paper contributes to four distinct literatures in accounting research: (1) on the 
market pricing of (the components of) cash flows and either earnings or accruals for non- 
financial firms, (2) on the market pricing of the components of earnings for financial insti- 
tutions, (3) on the distinct nature of trading firms and the usefulness of the financial infor- 
mation they provide, and (4) on the importance of the financial statement classification of 
hybrid items. We discuss each of these literatures and our contributions to them in turn. 

The first literature—examining the market pricing of (the components of) cash flow 
and either earnings or accruals for nonfinancial firms—is large and well established.* Of 
this literature, Livnat and Zarowin (1990) is the paper most closely related to ours, being 
the only one to examine the valuation implications of the components of cash flow. For a 
sample of nonfinancial firms, Livnat and Zarowin (1990) predict based on financial eco- 
nomic theories and find that the market prices CFO, CFI, CFF, and accruals differently. 
Specifically, they find that returns are significantly positively associated with both CFO and 
accruals, less strongly and significantly negatively associated with CFI, and insignificantly 
positively associated with CFF. Consistent with Livnat and Zarowin's (1990) finding that 
returns are positively associated with CFO and accruals, for samples of nonfinancial firms, 
Dechow et al. (1998) and Barth et al. (2001) find that CFO and the components of accruals 
are significantly positively associated with future CFO. We contribute to this literature by 
developing and testing hypotheses about the implications for returns and future CFO of the 
hybrid trading and HFS loan components of cash flows and accruals. We expect these 
components to fall at different places along the continuum between the primarily operating 
and non-operating items examined by the prior literature. 

The second literature—examining the valuation implications of components of earnings 
for financial institutions—is also large and well established. For example, for samples of 


? See Ryan (2002) for elaboration of this point in the contexts of various types of financial institutions. 

3 Our study may also have implications for the analogous SCF classification issues that arise for nonfinancial 
firms as well. For example, many academics and practitioners (e.g., Nurnberg 1993) have criticized nonfinancial 
firms' classification of the following cash flows as operating under SFAS No. 95 and SFAS No. 102 (FASB 
1987, 1989): interest receipts and expenditures, dividend receipts, expenditures for inventory to stock newly 
opened stores, and installment sales receipts. Of these cases, installment sales, which involve both operations 
and financing, exhibit a nature closest to the hybrid nature of trading. 

^ See, for example, Wilson (1986, 1987), Bowen et al. (1987), Rayburn (1986), Bernard and Stober (1989), Ali 
(1994), and Dechow (1994). 
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property-casualty insurers, Foster (1977) examines the market pricing of the underwriting, 
investment, and capital gains components of earnings, and Beaver and McNichols (2001) 
examine the market pricing of the CFO, accrual, and loss reserve development components 
of earnings. For samples of banks, Barth et al. (1990) examine the market pricing of 
earnings before security gains and losses and realized security gains and losses, Barth 
(1994) examines the market pricing of these variables and also unrealized security gains 
and losses, and Wahlen (1994) examines the market pricing and future CFO implications 
of earnings before the provision for loan losses, the provision for loan losses, and various 
other loan default variables. While some of these prior studies examine gains and losses 
and find them to be value-relevant, they do not focus on trading gains and losses, as we 
do. We contribute to this literature by being the first study to examine the market pricing 
of components of cash flows for any type of financial institution. The distinctions between 
operating and non-operating activities and thus between operating and non-operating cash 
flows are inherently blurry for financial institutions, because their operations involve in- 
vesting and financing. 

The third literature—examining the distinct nature of trading firms and the usefulness 
of the financial information they provide—is both small and recent. Jorion (2002) and Liu 
et al. (2004) examine the risk-relevance of banks’ Value-at-Risk disclosures for their trading 
portfolios. These papers relate to our study insofar as they are affected by some of the 
same economic and financial reporting phenomena that motivate our analysis, for example, 
banks’ matching of trading assets and liabilities. We contribute to this literature by providing 
new insights into the distinct nature of trading firms, in particular, how their trading posi- 
tions blur the distinctions between operating and non-operating activities. 

The fourth literature—examining the importance of financial statement classification of 
hybrid items—is also fairly small and recent. In an experimental study, Hopkins (1996) 
shows that buy-side financial analysts’ assessment of a specific financing instrument with 
a hybrid nature—mandatorily redeemable preferred stock—is affected by how that instru- 
ment is classified on the balance sheet. Hopkins' (1996) study is motivated by psychology/ 
judgment and decision-making research regarding how knowledgeable individuals organize 
and interpret information. Cheng et al. (2000) and Linsmeier et al. (2000) empirically 
examine the association between market value and various hybrid financing instruments 
that may be classified as debt, equity, or in the mezzanine on the balance sheet. Our paper 
contributes to this empirical literature by being the first market-based study of the relation 
between the market pricing and SCF classification of hybrid items. 


Ш. RELEVANT GAAP AND HYPOTHESIS DEVELOPMENT 

Relevant GAAP 

GAAP governing the classification of cash flows acknowledge that certain activities 
have both operating and non-operating aspects. SFAS No. 95 provides intuitive and well- 
known characterizations of investing and financing activities that we do not repeat here. 
Rather than defining operating activities directly, SFAS No. 95 (paragraph 21) defines op- 
erating activities as a catchall category that includes “‘all transactions and other events that 
are not defined as investing or financing activities." This suggests that hybrid transactions 
that do not cleanly fall into the investing or financing categories should be classified as 


5 John Dickhaut points out that classification can affect market prices even under models with fully rational 
investors depending on the specific process by which prices form, and so psychology theories are not the only 
possible reason why classification might matter. We believe his point is worthy of future research by accounting 
theorists, who up to this point have not focused on the effects of financial statement classification. 
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operating. However, in SFAS No. 95 (paragraph 24), FASB states that "[c]ertain cash 
receipts and payments may have aspects of more than one class of cash flows ... [i]f so, 
the appropriate classification shall depend on the activity that is to be the predominant 
source of cash flows for the item." 

In SFAS No. 102, FASB (1989) decided that the predominant characteristics of trading 
positions and HFS loans are operating, requiring in SFAS No. 102 (paragraphs 8 and 9) 
that the cash flows on assets “‘acquired specifically for resale and ... carried at market value 
in a trading account" and loans “originated or purchased specifically for resale and ... held 
for short periods of time" be classified as operating. In SFAS No. 102 (paragraph 26), 
FASB justified this requirement stating that these assets are "'similar to inventory in other 
businesses." No GAAP rule developed prior to the issuance of SFAS No. 149 in April 2003 
specifically considers the classification of the cash flows from trading liabilities with initial 
principal or any other type of liability that has both operating and financing aspects. Prior 
to SFAS No. 149, the practice for our sample of banks (and, in our understanding, for 
trading firms generally) was to classify the cash flows on trading liabilities as operating, 
consistent with the classification of the cash flows on trading assets. As discussed in the 
introduction, this practice likely will change due to the issuance of SFAS No. 149. 


Hypothesis Development 

We expect the associations of returns and future CFO with the cash flows and accruals 
resulting from an activity to rise with the extent to which that activity is positive present 
value (or is associated with other positive present value activities), and, if the activity is 
positive present value, with its persistence. We expect operating activities to be more pos- 
itive present value and possibly more persistent than are non-operating activities for the 
following reasons. First, capita] markets are relatively efficient compared to operating mar- 
kets and non-market settings. Second, numerous financial economic theories surveyed by 
Livnat and Zarowin (1990) imply that managerial inside information and other market 
imperfections can cause investing and financing activities to be either positive or negative 
present value, with these effects possibly offsetting. Third, an August 1999 G4+1 discus- 
sion paper on reporting financial performance concludes that operating activities tend to be 
*value adding" and "recurring." Fourth, Livnat and Zarowin (1990) find that returns are 
significantly positively associated with CFO and accruals, are less strongly and significantly 
negatively associated with CFI, and are insignificantly positively associated with CFF. Fi- 
nally, Dechow et al. (1998) and Barth et al. (2001) find that future CFO is significantly 
positively associated with CFO and the components of accruals. 

For banks, operating activities often generate related ongoing streams of fee income, 
may involve skill or other comparative advantage that yields positive net present value, and 
tend to be highly repetitive. Operating activities are especially likely to be positive present 
value for large banks that engage in investment banking and other high-margin services 
businesses. In contrast, banks’ non-operating activities are less likely to be associated with 
ongoing streams of fee income and are more likely to be close to zero net present value, 
although they often are highly persistent. In particular, we expect the value implications of 


$ In paragraphs 93—95 of SFAS No. 95, FASB provides the example of installment sales—which have both 
operating and investing characteristics for the seller—to illustrate how the predominant characteristics should 
be determined. The FASB concludes that the predominant characteristic of installment sales is operating, stating 
“cumulative cash flow from operating activities over the life of an enterprise that finances most of its sales 
under installment plans might be negative." In reaching this conclusion, FASB appears to be primarily concerned 
with the usefulness of CFO as a performance measure. 
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investing and financing activities predicted by the financial economic theories mentioned 
above to be relatively small (though not necessarily zero) for banks, due to the highly 
competitive nature and frequency of those activities. For these reasons, we expect banks’ 
returns and future CFO to be more positively associated with their CFO and accruals than 
with their non-operating cash flows. 

Trading has characteristics of an operating activity insofar as it requires managerial 
effort and skill to identify mispriced assets and liabilities and to manage complex and 
interrelated positions and this effort and skill generates positive net present value through 
trading gains or fee income for services such as dealing, financial advice, and securities 
custody and processing. Though typically held for a shorter time than non-trading assets 
and liabilities, trading assets (liabilities) constitute investments (provide financing) for as 
long as they are held. Thus, trading has both operating and non-operating aspects. These 
aspects are inseparable, however, because the generation of trading revenue often requires 
traders to take positions with initial principal. For example, a derivatives dealer that writes 
a call option for a client receives a cash premium at inception; the portion of that premium 
that exceeds the fair value of the option is trading revenue and the remainder of the premium 
is the initial principal amount of a trading liability.’ 

Like trading positions, HFS loans have characteristics of an operating activity insofar 
as originating these loans requires managerial effort and skill to solicit and screen borrowers 
and this effort and skill generate positive net present value or are associated with services 
that yield fee income (e.g., loan origination and servicing). Though typically held for a 
shorter time than loans held in portfolio, HFS loans constitute investments for as long as 
they are held. 

Trading positions have relatively more of an operating nature than HFS loans, however, 
for two reasons. First, the management of trading positions both individually and at the 
portfolio level can be complex, and these positions are more likely to have positive net 
present value and to be associated with high-margin services. In contrast, most HFS loans 
are residential mortgages or consumer loans, commodity financial products that are origi- 
nated, serviced, and sold in highly competitive markets. Second, many types of trading 
firms (e.g., dealers and arbitrageurs) manage their trading positions at the portfolio level, 
assuming largely matched positions in trading assets and liabilities, where their trading 
liabilities both finance and hedge their trading assets. Well-matched trading portfolios re- 
quire little net investment or provide little net financing, thereby accentuating the operating 
aspects of those portfolios. 

In summary, trading positions and HFS loans are mixtures of operating and non- 
operating activities, with the mix being tilted relatively more toward operating for trading 
positions than for HFS loans. Under the maintained assumption that investors have some 
(though not necessarily a perfect) understanding of these classification issues, we hypoth- 
esize (in alternative form) that the associations of both returns and future CFO with changes 
in net trading assets and HFS loans reflect these mixtures as follows: 


Н: Returns and future CFO have: 
e relatively high positive associations with the portion of operating cash flows and 
accruals arising from activities other than trading and HFS loans, 


7 While GAAP have changed somewhat over time, dealers generally are allowed to recognize trading revenue 
(dealer profit) at the inception of trading positions if the market provides sufficient evidence of fair value (see 
AICPA Audit and Accounting Guide for Brokers and Dealers in Securities, EITF 00-17, and EITF 02-3). 
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* a lower positive association with the change in net trading assets, 
® an even lower positive association with the change in HFS loans, and 
* relatively small positive or negative associations with CFI and CFF. 


This compound hypothesis contains four separate hypotheses about the levels of the asso- 
ciations of returns and future cash flows with specific components of cash flows and ac- 
cruals. In Sections V and VI we restate Hypothesis H as restrictions on differences between 
coefficients in our regression equations. | 


IV. SAMPLE, DEFINITIONS, AND DESCRIPTIVE STATISTICS 
Sample 

To be included in our sample, a firm must be a U.S. commercial bank or thrift (hereafter 
bank) appearing on the 2002 or 2003 Annual Bank Compustat file that has at least $500 
million in total assets and trading assets equal to at least 1 percent of total assets in at least 
one year in our sample period. Since banks’ regulatory capital leverage ratios? average 8 
to 9 percent during our sample period, the sample banks’ trading assets typically exceed 
that percentage of their owners’ equity. We imposed the conditions on total and trading 
assets to ensure that the banks had appreciable trading operations. We did not require these 
conditions to be met in each year for a given bank in order to preserve time-series data 
and thus statistical power. We did not include Citigroup after the 1998 acquisition of 
Citicorp by Travelers in the sample due to the concern that banks and insurance compan- 
ies’ cash flows and accruals would have different natures and thus different valuation 
implications. 

To the extent possible, we obtained financial report data from Bank Compustat. Since 
Bank Compustat does not include SCF data, we manually collected the following variables 
from the SCFs for each bank for every year the variables were available online on the 
SEC’s EDGAR database: CFO, CFI, CFF, the change in net trading assets, the change in 
HFS loans, and changes in all other accrual accounts during the year. Because the EDGAR 
database begins with 1993 filings by a subset of firms and SCFs include three years of 
data, our SCF data begins in 1991 with the seven banks that filed on EDGAR іп 1993.3 
For 27 (8 percent) of our observations, banks did not provide separate line items for the 
change in net trading assets on their SCF, and in these cases we constructed these line 
items using the changes in these balance sheet amounts if the banks provided this infor- 
mation in two consecutive years. None of our results change substantively if we drop these 
observations. 

We obtained annual returns from May of the fiscal year to April following the fiscal 
year, adjusted for stock splits and dividends, denoted R, from the 2003 CRSP Monthly file. 
Combining the data from Bank Compustat, EDGAR, and CRSP yields 330 observations | 
from 1991—2003 with the complete data needed for the main returns regressions analyses. 
Table 1, Panel A reports the effect of each of the above conditions on the sample size. 

The banks in our sample are individually large, and they hold a high and increasing 
percentage of the assets of the U.S. banking system. For example, these banks held 31 


* Banks’ leverage ratio for regulatory capital purposes is Tier 1 capital divided by regulatory assets, which roughly 
corresponds to owners’ equity divided by assets. 

* For certain observations, the data taken from the 1993 filings on EDGAR for the 1991 and 1992 fiscal years 
may be restated for mergers accounted for as poolings of interest. In Section VI, we discuss the results of a 
specification test in which observations that appear to have had significant mergers or acquisitions or significantly 
restated earnings in a year are eliminated. This specification test yields results substantively similar to those 
reported in this paper. 
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TABLE 1 
Sample 


Panel A: Sample Construction 


U.S. banks on 2002 or 2003 Bank Compustat 
~ U.S. banks with less than $500 million assets 


= U.S. banks that with more than $500 million assets 
— U.S. banks with trading assets less than 1 percent of total assets 


- Sample banks 
Panel B: Sample Banks* ^ 


Assets Trading% CFO АСС ATA AHFS 


451 


Derivatives dealers* 


Bank of America 392.6 :068 157 -.013 073 .022 
Bank of New York 624 .055 102 —.010  .008  .096 
Citicorp 261.3 .105 097  .022 2417 .011 
Fleetboston Financial 136.0 017 192 -.089 .015 .028 
JPMorgan Chase & Со 408.9 .186 142 020 178  .000 
Wachovia Corp 191.7  .043 .186 —.063  .038 .032 
Non-dealer banks 
Bank One Corp 210.2 .030 .204 —.112 -.037 .000 
Bankatlantic Bancorp? 44 .012 549 —.458 .068 —.281 
Beverly Hills Bancorp* 1.4 012 —1.814 1.832 —.006 1.920 
Chester Vy Bancorp 0.6 .000 007 010 .005: .000 
City National Corp 6.6 .009 147 —.100 -.009  .000 
Commerce Bancorp 8.2 011 140 -.051  .022 —.025 
Community West Bancshares 0.3 .020 —.129 .168 —.051  .443 
First Defiance Financial? 1.1 010 | —1.027  .859 —.046  .529 
First Horizon National Corp 8.8 .017 —.056 206 .052 270 
First Tennessee National 16.7 .019 —.005 .090 .017  .073 
Irwin Financial Corp 26  .027 —.386 454 .040  .485 
M & T Bank Corp 25.0 .006 115 -.027  .002  .029 
Massbank Corp 0.9 .047 058 029 .019 .005 
Mellon Financial Corp 40.7 006 148 —.044  .000  .015 
National Commerce Financial 11.0 .008 073 —.009 .008 .001 
National Penn Bancshares 2.0 006 072 —.009  .008 —.002 
Ocwen Financial Corp? 2.3 056 095 —.131 -.188 .006 
Oriental Financial Group 1.9 017 018 .093 .001 .077 
Popular Inc 209 .009 202 —.085  .012 .030 
В & С Financial Corp 42 07 -.363 .607 .289  .625 
Regions Financial Corp 27.6 .005 114 —.013 .002  .018 
Republic Bancshares Inc 20 .010 077 —.054  .074 .000 
State Street Corp 452 012 079 —.011 010 .000 
Sterling Bancshares 29 034 035  .034 .085 —.032 
Suntrust Banks Inc 582 .003 101 —.017 -.004  .015 
Union Planters Corp 26.8 .010 219 —.083  .014  .000 
Unionbancal Corp 33.2 .009 .162 —.037 .014 -.001 
Webster Financial Corp? 8.1 .007 201 —041 .029  .043 
Wells Fargo & Co 1672 .010 080  .016 .009  .097 
Woronoco Вапсогр“ 0.8 012 218 -141 —.121  .007 
Zions Bancorporation 13.6 .016 081  .015 .014  .025 


(continued on next page) 
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* Trading% denotes the ratio of trading assets to total assets; CFO denotes cash flow from operations; ACC 
denotes total accruals, i.e., the difference between net income and CFO; АТА denotes the change in net trading 
assets; AHFS denotes the change in held-for-sale loans; CFI denotes cash flow from investing; CFF denotes 
cash flow from financing. Total Assets are in billions. All variables other than Total Assets and Trading% are 
deflated by beginning-of-year market value of equity. 

^ AHFS is assumed to be zero for bank-years it is not reported on the Statement of Cash Flows. 

* Indicates derivatives dealers that report changes in both trading assets and trading liabilities in their Statement 
of Cash Flows. 

4 Indicates thrifts. 


percent (25 percent) of the assets of U.S. commercial banks (commercial banks and thrifts) 
in 1995 and 45 percent (38 percent) of the assets of U.S. commercial banks (commercial 
banks and thrifts) in 2003.0 Mergers and acquisitions are the primary cause of the rise in 
these percentages over time. Ignoring our omission of Citigroup after its acquisition by 
Travelers in 1998, our sample banks held almost all of Фе trading assets and trading 
liabilities of the U.S. banking system throughout our sample period. 

Our sample contains two distinct subsamples. Six of the largest commercial banks— 
Bank of America, Bank of New York, Citicorp, FleetBoston, JPMorgan Chase, and 
Wachovia—collectively dominate the global over-the-counter derivatives-dealing markets. 
Derivatives dealers closely match the value and risk characteristics of their derivatives assets 
and liabilities. As a consequence of this matching, the six dealers have substantial trading 
liabilities that average about half the amount of their trading assets on average, with the 
excess of their trading assets over their trading liabilities reflecting their classification of a 
significant portion of their investment securities as trading. The remaining 31 non-dealer 
banks have minimal or no trading liabilities, and their trading assets are primarily securities. 
To determine the nature and extent of the heterogeneity across the two subsamples, in 
addition to testing our hypotheses on the full sample, we test for differences across the two 
subsamples. 

Table 1, Panel B lists the 37 banks (31 commercial banks and six thrifts) in the sample, 
distinguishing the dealers from the non-dealer banks. To provide a sense for the hetero- 
geneity of the sample banks, the panel reports the mean values of total assets, trading assets 
as a percentage of total assets, and certain key regression variables defined below for each 
bank across the years it appears in the sample. 


Definitions of the Cash Flow and Accrual Components 

The banks in our sample all report CFO using the indirect method. This method begins 
with net income, denoted NJ, and then subtracts certain items that for simplicity (but not 
entirely accurately, as discussed below) we refer to as total accruals, denoted ACC, in order 
to arrive at CFO. We decompose ACC into the change in net trading assets (trading assets 
minus trading liabilities), denoted ATA, the change in HFS loans, denoted АНЕ, and all 
other accruals, which we refer to as ordinary accruals, denoted OACC. This breakdown 
reflects the following representation of the operating section of the SCF: 


10 Assets for the commercial banking industry are obtained from Carlson and. Perli (2004) and assets for the thrift 
industry are obtained from Office of Thrift Supervision (2004). 
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NI — ACC = NI - OACC - ATA - AHFS = CFO. (1) 


Of the three components of ACC, only OACC reflects pure accruals, i.e., timing dif- 
ferences between NI and CFO. While ATA and AHFS include unrealized gains and losses, 
which are timing differences, they also include principal cash flows that never affect NI but 
do affect CFO. For example, ATA equals a timing difference—the unrealized gain on trad- 
ing positions during the period—plus a net principal cash outflow—the purchases of net 
trading assets during the period minus the principal payments on net trading assets during 
the period minus the cost of net trading assets sold during the period. If we ignore the 
possibility of default by the firm or its counterparty, then the principal cash flows on ATA 
and AHFS sum to zero over the lives of those items. 

Reworking Equation (1) slightly, we define ordinary cash flow from operations, OCFO, 
as CFO plus ATA and AHFS, which equals МІ minus OACC: 


OCFO = CFO + ATA + AHFS = NI — OACC. (2) 


Just as OACC reflects pure accruals, OCFO reflects pure operating cash flows. 
The definitions in Equations (1) and (2) correspond to the following analytical rework- 
ing of the operating section of the SCF: 


NI 
— OACC 


= OCFO 
— ATA 
— AHFS 
= CFO 


In the rest of this paper we refer to OCFO and OACC as the pure operating components 
of cash flow and accruals, respectively. We refer to ATA and AHFS as hybrid operating/ 
non-operating components of cash flow and accruals, where as discussed above ATA is 
relatively more operating than AHFS. We refer to CFI and CFF as the pure non-operating 
components of cash flow. 

To illustrate these definitions, the Appendix reports M&T Bank Corporation’s 2000 
SCF with the operating section of that statement reworked in the fashion described above. 
For this observation, both ATA ($6.9 million) and AHFS ($81.5 million) are positive, with 
AHFS being quite large, so that OCFO ($435.1 million) is 26 percent more positive than 
reported CFO ($346.6 million), and OACC (—$148.9 million) is 146 percent more negative 
than ACC (—$60.5 million). As indicated by the descriptive statistics for these variables 
discussed below, large differences between the pure operating and reported measures of 
CFO and accruals are common in our sample. The investing and financing sections of this 
SCF provide representative examples of non-operating cash flows for the banks in the 
sample that indicate the arbitrary nature of SCF classifications for banks and other financial 
institutions. For example, M&T Bank's cash flows from investing activities include the 
principal cash flows on (available-for-sale and held-to-maturity) investment securities that 
would have been classified in cash flows from operating activities had M&T Bank classified 
those securities as trading. 
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As discussed above, ATA equals unrealized gains (more operating) plus principal cash 
outflows for trading positions (less operating) during the period. Neither of these compo- 
nents of ATA is directly observable. In specification analysis reported in Section VI, we 
use the best available proxy for one of the components of ATA to distinguish the compo- 
nents and thereby analyze their distinct valuation consequences. Specifically, for the ob- 
servations for which trading revenue, denoted TR, is reported, we use TR as a proxy for 
the unrealized gain component, so that ATA — TR proxies for the principal cash outflow 
component. ТК includes unrealized and realized gains and may include some fee revenue 
from trading. As a result, TR overstates (understates) unrealized gains by the amount of 
realized gains (losses), and it overstates unrealized gains by the amount of fee revenue it 
includes. Since TR measures unrealized gains with error, ATA — TR measures the principal 
cash outflows with an error of the same magnitude but opposite sign. The amount of this 
` error cannot be determined. 


Descriptive Statistics 

Table 2 reports descriptive statistics for the regression variables, total assets, and the 
ratio of trading assets to total assets. Panel A reports these statistics for the full sample, 
while Panels B and C report them for the subsamples of six dealers and 31 non-dealer 
banks, respectively. All of the regression variables other than & (which is naturally deflated) 
are deflated by the market value of equity at the beginning of the fiscal vear. To mitigate 
the effect of outliers in the regression analyses, all of the regression variables are winsorized 
at three standard deviations from their means and descriptive statistics are reported on the 
winsorized variables. 

Panel A of Table 2 shows that, like the rest of the banking industry, the sample banks 
generally performed well from 1991—2003, as evidenced by positive mean R (.248), CFO 
(.080), and ACC (.016). Moreover, the mean of OCFO (.177) is more than twice as large 
. as the mean of CFO (.080), because the sample banks grew their net trading assets and 
HFS loans over the sample period (mean ATA = .028 and mean AHFS = .069), depressing 
CFO substantially. The standard deviations of ATA and AHFS, .220 and .339, are each 
substantial relative to the standard deviations of CFO and ACC, .438 and .413, suggesting 
that CFO and ACC are heterogeneously noisy measures of the pure operating components 
OCFO and OACC, respectively. Consistent with their good performance, the sample banks 
raised substantial financing (mean CFF = .443) and made significant investments (mean 
CFI = —517). 

Comparison of Panels B and C of Table 2 shows that, as expected, the six dealers have 
much higher mean total assets ($264 billion versus $32 billion), mean ratio of trading assets 
to total assets (.090 versus .017), and mean ТА (.082 versus .023) than the 31 non-dealer 
banks. The dealers have much higher mean CFO (.149 versus .060) and OCFO (.265 versus 
.152) than the non-dealer banks, suggesting better performance, although mean R is similar 
for the two subsamples. The dealers grew their trading assets faster (mean ATA = .083 
versus .013) but their HFS loans slower (mean AHFS = .033 versus .079) than the non- 
dealer banks. 

Table 3 reports both Pearson and Spearman correlations for the variables in our returns 
regressions, many of which are significantly correlated. Most of the Pearson and Spearman 
correlations are similar, although we discuss one notable difference below. We summarize 
only the more interesting of these correlations and the underlying phenomena they suggest. 
R is more highly positively correlated with OCFO (Pearson = .176; Spearman = .262) 
than with CFO (Pearson = .059; Spearman = .070) and with ACC (Pearson .008; 


uod 
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TABLE 2 

Descriptive Statistics* Р 
Panel A: Full Sample (330 bank-years) 
Variable _Mean_ Std. Dev. Minimum 
R 0.248 0.404 —0.899 
CFO 0.080 0.438 —1.816 
FCFO3* 0.609 0.955 —2.911 
АСС 0.016 0.413 —1.929 
АТА 0.028 0.220 —1.025 
AHFS* 0.069 0.339 —1.646 
OCFO 0.177 0.246 —1.068 
OACC —0.081 0.189 —0.793 
CFI —0.517 1.082 —4.288 
CFF 0.443 1.137 —3.427 
ТК° . 0.038 0.070 —0.069 
Trading% 0.031 0.052 0.000 
Total assets 76,724 143,142 252.03 
Panel B: Dealer Banks (73 bank-years) 
R 0.267 0.363 —0.450 
CFO 0.149 0.349 —0.777 
FCFO3? 1.026 1.186 — 1.058 
ACC —0.025 0.339 —1.819 
ATA 0.083 0.314 -1.025 
AHFS* 0.033 0.149 —0.607 
OCFO 0.265 0.262 —0.155 
OACC —0.141 0.227 —0.793 
CFI —0.151 0.960 -2.172 
СЕЕ 0.012 0.968 -3.427 
TR 0.082 0.101 0.008 
Trading% 0.090 0.077 0.002 
Total assets 263,564 216,569 39,426 
Panel C: Non-Dealer Banks (257 bank-years) 
R 0.242 0.415 —0.899 
CFO 0.060 0.459 —1.816 
ЕСЕОЗЯ 0.476 0.829 —2.911 
ACC 0.028 0.431 —1.929 
ATA 0.013 0.182 —1.025 
AHFS 0.079 0.375 —1.646 
OCFO 0.152 0.236 -1.068 
ОАСС -0.064 0.173 -0.793 
СЕ -0.620 1.094 -4.288 
СЕЕ 0.565 1.153 —3.427 
TR 0.023 0.048 —0.069 
Trading % 0.017 0.030 0.000 
Total assets 31,703 61,706 252 
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Median Maximum 
0.210 1.531 
0.105 1.929 
0.447 4.257 

—0.015 2.659 
0.003 1.108 
0.000 1.925 
0.122 1.387 

—0.039 0.628 

—0.378 3.232 
0.299 4.368 
0.012 0.375 
0.013 0.328 

19,518 770,912 
0.238 1.463 
0.139 1.929 
0.649 4.257 

—0.018 0.680 
0.018 1.108 
0.000 0.774 
0.191 1.387 

—0.089 0.184 

—0.288 3.232 
0.106 2.018 
0.039 0.375 
0.077 0.328 

200,500 710,912 
0.205 1.531 
0.094 1.929 
0.419 2.969 

—0.012 2.659 
0.002 1.108 
0.000 1.925 
0.117 1.382 

—0.031 0.628 

-0,419 3.232 
0.377 4.368 
0.007 0.262 
0.010 0.265 

11,857 387,798 


(continued on next page) 
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* R denotes share return for the 12 months ending on April 30 following the fiscal year-end; CFO denotes cash 
flow from operations; FCFO3 denotes the sum of the next three years’ CFO; ACC denotes total accruals, i.e., 
the difference between net income and CFO; ATA denotes the change in net trading assets; AHFS denotes the 
change in held-for-sale loans; OCFO denotes CFO + ATA + AHFS; OACC denotes ACC — ATA — AHFS; 
CFI denotes cash flow from investing; CFF denotes cash flow from financing; TR denotes trading revenue; 
Trading% denotes the ratio of trading assets to total assets; and Total Assets are in millions. All variables other 
than R, Trading%, and Total Assets are deflated by beginning-of-year market value of equity. 

> All variables, except for Trading% and Total Assets, are winsorized at three standard deviations from their 
means. 

° For bank-years that do not report AHFS on the Statement of Cash Flows, AHFS is assumed to be zero. The 
number of non-zero observations for AHFS is 189, 21, and 168 for Panels A, B, and C, respectively. 

* The number of observations for FCFO3 is 226, 55, and 171 for Panels A, B, and C, respectively. 

° The number of observations for TR is 260, 66, and 194 for Panels A, B, and C, respectively. 





Spearman = .046) than with OACC (Pearson = —.076; Spearman = —.098). These differ- 
ential Pearson correlations are driven by the significant positive Pearson correlation between 
К and ATA (.131), because the Pearson correlation between R and AHFS is insignificant. 
In contrast, these differential Spearman correlations are driven by the significant positive 
Spearman correlation between R and AHFS (.138), because the Spearman correlation be- 
tween R and ATA is insignificant. OCFO is positively correlated with both ATA (Pearson 
= ,159; Spearman = .161) and AHFS (Pearson = .083; Spearman = .088), consistent with 
banks that generate OCFO investing a portion of that cash flow in net trading assets and 
HFS loans. CFI is negatively correlated with ATA (Pearson = -.109; Spearman = —.022), 
and CFF is positively correlated with both ATA (Pearson = .222; Spearman = .153) and 
AHFS (Pearson = .245; Spearman = .164), consistent with net trading assets displacing 
other investments because they are financed through the same sources. CFI and CFF are 
highly negatively correlated (Pearson = —.867; Spearman = —.828), consistent with fi- 
nancing inflows being used to fund investing outflows." 

The relatively high correlations among the variables imply that it is necessary to have 
a fairly large sample of observations in order to generate statistically powerful tests in our 
regression analyses.'* Given our small number of banks, this requires pooling observations 
across a sufficient number of years. Since returns are relatively uncorrelated through time 
(e.g., the first-order autocorrelation equals .02), this pooling should not yield serially cor- 
related residuals or overstated significance levels in our returns analysis, and in specification 
analysis we find that this is not the case. We allow for first-order autocorrelation of residuals 
in our future CFO analysis. 


11 These correlations differ from those for nonfinancial firms reported in prior research (e.g., Barth et al. 2001). 
To illustrate these differences, we estimated the correlations of CFO, ACC, CFI, and CFF from for all the firms 
on the Compustat industrial tape over our sample period 1991-2003, winsorizing this data at three standard 
deviations as we do in the paper. We find that the correlation between СЕС and ACC is —24 and the correlation 
between CFI and CFF is —.53 for this broader sample of firms, i.e., considerably less negative than for our 
sample. We believe the more negative correlation between CFO and ACC for banks reflect the monetary nature 
of banks’ assets and liabilities, which renders the distinction between cash and accruals less meaningful, since 
accruals are often very close to cash. We believe the more negative correlation between CFI and CFF for banks 
reflects their matching of the magnitude of financial assets (investments) and liabilities (financing). On the other 
hand, we find the correlation between CFO and CFI is —.25 and the correlation between CFO and CFF is —.43 
for this broader sample of firms, i.e., considerably more negative than for сніг banking sample. We believe these 
more negative correlations reflect nonfinancial firms compensating more strongly for lower CFO by either raising 
funds externally or reducing investments, and vice versa. 

12 While our variables are highly correlated, none of the variance inflation factors on the variables in our regression 
equations exceeds 10, the usual threshold for concern about multicollinearity. 
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V. PRIMARY RETURNS AND FUTURE CFO ANALYSIS 
Returns Analysis 

In this section we estimate and compare the market pricing of the sample banks’ cash 
flow and accrual components. We regress share returns for the 12 months ending on April 
30 following the fiscal year on these components aggregated in three different ways. First, 
as a benchmark and to articulate with prior research by Livnat and Zarowin (1990), we 
estimate a relatively aggregated model with CFO, ACC, and the non-operating components 
CFI and CFF as explanatory variables. Second, we estimate a disaggregated version of the 
first model in which the explanatory variables CFO and ACC are broken into their operating 
components, OCFO and OACC, and their hybrid operating and non-operating components, 
ATA and AHFS. Third, we estimate a reaggregated version of the second model in which 
the operating components are replaced with the single explanatory variable OP = OCFO 
+ OACC and the non-operating components are replaced with the single explanatory vari- 
able NONOP = CFI + CFF. 

The third model is attractive because it corresponds in the most direct fashion to our 
hypotheses and because by summing CFI and CFF it eliminates the strongest source of 
multi-collinearity among the explanatory variables. For this reason, we tabulate and discuss 
in detail our hypotheses tests regarding differences between the coefficients on changes in 
trading assets and HFS loans and the other components of cash flows and accruals only 
using the third model, although we briefly discuss the untabulated results for the other 
models as well. Our inferences are the same if we use the second model, however, because 
as discussed below we find insignificant differences between the coefficients on OCFO and 
OACC and between the coefficients on CFI and CFF in that model. 

Specifically, our first returns model, similar to Livnat and Zarowin’s (1990) Equation 
(3), is:? 


R, = а + b,ACC, + b,CFO, + b,CFI, + b,CFF, + e, (3) 


In this and subsequent models, the explanatory variables are all deflated by the beginning 
of fiscal year market value of equity, consistent with prior research using returns models 
(e.g., Christie 1987). Under Hypothesis H, we expect relatively high positive coefficients 
on the primarily operating components ACC and CFO and relatively small positive or 
negative coefficients on the non-operating components CFI and CFF. Restating Hypothesis 
H as restrictions on the coefficients in Equation (3): 


H-Equation (3): b, = b, > b, = Б, = 0. 


Hypothesis H-Equation (3) is consistent with Livnat and Zarowin's (1990) results for their 
sample of nonfinancial firms. 

Table 4 reports the estimations of the primary returns and future CFO models. As 
mentioned earlier, all regression variables are winsorized at three standard deviations from 


13 Livnat and Zarowin's (1990) explanatory variables in their main regression model are first-differenced. In con- 
trast, our explanatory variables are in levels form, because this appears to yield measures closer to the unexpected 
portions of those variables for our sample. In particular, the levels of the variables have higher associations with 
returns both individually and collectively (i.e., in terms of model R?). We attribute the superior power of a levels 
model in our setting in part to our decomposition of CFO into OCFO + ATA + AHFS. If we used a first- 
differenced model, then the second differences A(A7A) and A(AHFS) would be the explanatory variables. These 
second differences do not correspond well to the unexpected portions of these variables, presumably due to 
over-differencing. In any event, our results for Equation (3) are quite consistent with those of Livnat and Zarowin 
(1990) for their returns model, as discussed below, supporting our specification. 
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Regressions of Returns and Future Operating Cash Flows on Cash Flow 
and Accrual Components*^ 


Panel A: Estimation 











R, = a + b,ACC, + b,CFO, + b,CFI, + b,CFF, + е, (3) 
В, = a + b,OACC, + b,OCFO, + b,ATA, + b,AHFS, + b,CFI, + b,CFF, + е, (4) 
К, = а + ЂОР, + ЂАТА, + b,AHFS, + b,NONOP, + e, (5) 
FCFO3, = а + b,ACC, + Б. СРО, + b,CFI, + b,CFF, + е, (6) 
ЕСЕОЗ, = а + b,OACC, + b,OCFO, + b,ATA, + b,AHFS, + b,CFI, + b,CFF, + e, (7) 
ЕСЕОЗ, = a + БОР, + b,ATA, + b,AHFS, + b,NONOP, + e, (8) 
Returns Future Cash Flows* 
Model (3) _ (4) (5) (6) (7) (8) 
Intercept 0.175*** — 0.170*** — 0.172*** 00159999 — 0365*** | 0371*** 
(6.4): (6.0) (6.4) (4.2) (3.5) (3.1) 
ACC 0.611*** 2.324*** 
(3.3) (5.1) 
СЕО 0.510*** 1.864*** 
(2.9) (4.4) 
OACC 0.632*** 1.229** 
| (2.8) (2.1) 
OCFO | 0.567*** 2.037*** 
(3.2) (4Л) 
ОР 0.616*** 1.980*** 
(3.7) (4.7) 
ATA 0.324*** 0.321*** 0.854*** 0.924*** 
(2.9) (3.2) (4.2) (4.9) 
AHFS 0.027 0.021 0.388** 0.486*** 
(0.3) (0.3) Q.1) (2.7) 
CFI —0.127** —0.114** —0.039 0.016 
(-2.4) (-22) (-0.4) (0.2) 
CFF —0.098* —0,092* —0.157* —0.135 
(—1.9) (—1.8) (—17) (—1.5) 
NONOP —0.089* —0.108 
(—1.9) (—1.4) 
Adjusted R? 0.055 0.068 0.070 — — — 
Likelihood — — — 
Ratio Test (x?) 61.33 35.36 57.62 
# Observations 330 330 330 226 226 226 
Panel B: t-tests of Differences of Coefficients in Equation (5) 
TA HFS NONOP 
ОР 1.50* 3.00*** 4.37*** 
ТА 2.55*** 3.34*** 
HFS 1.00 


(continued on next page) 
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TABLE 4 (Continued) 
Panel C: t-tests of Differences of Coefficients in Equation (8) 








TA НЕ5 NONOP 
OP 2.48*** 3.12*** 4.92*** 
ТА 1.90** 4.56*** 
HFS 2.59%" 


* R denotes share return for the 12 months ending on April 30 following the fiscal year-end; CFO denotes cash 
flow from operations; FCFO3 denotes the sum of the next three years” CFO; ACC denotes total accruals, 1.е., 
the difference between net income and CFO; ATA denotes the change in net trading assets; AHFS denotes the 
change in held-for-sale loans; OCFO denotes CFO + ATA + AHFS; QACC denotes ACC — ATA — AHFS; OP 
denotes OCFO + OACC; CFI denotes cash flow from investing; CFF denotes cash flow from financing; 
NONOP denotes CFI + CFF. All variables other than R, are deflated by beginning-of-year market value of 


equity. 

b жж, ** and * indicate statistical significance at 1 percent, 5 percent, and 10 percent, respectively, in а one- 
tailed test if directional predictions are made and a two-tailed test otherwise. 

° To mitigate serial dependence of residuals, the estimation of the future cash flow models includes fixed effects 
for each bank (untabulated) and allows each bank’s residuals to follow the same AR(1) process. 


their means. In addition, Cook’s D test is used to check for any additional outliers in each 
model. No outlier is identified in the returns regressions, but one outlier is identified in the 
future CFO regressions and deleted. 

The estimation of Equation (3) is reported in the first column of Panel A. Consistent 
with Hypothesis H-Equation (3), the coefficients on the primarily operating components are 
significantly and similarly positive, with coefficients of .510 (t = 2.9) on CFO and .611 (t 
= 3.3) on ACC, and with these coefficients being insignificantly different from each other. '* 
In contrast, but also as hypothesized, the coefficients on the non-operating components are 
significantly and similarly negative, with coefficients of —.127 (t = —2.4) on CFI and 
—.098 (t — —1.9) on CFF, and with these coefficients being insignificantly different from 
each other. As hypothesized, the coefficients on each of CFO and ACC are significantly 
more positive than the coefficients on each of CFI and CFF. Thus, returns have a relatively 
strong positive association with the primarily operating components CFO and ACC and a 
relatively weak association with the non-operating components CFI and CFF. These results 
are qualitatively similar to those of Livnat and Zarowin (1990) for nonfinancial firms. 

Our second, disaggregated returns model is: 


К, = a + bOACC, + b,OCFO, + b,ATA, 
+ b,AHFS, + b,CFI, + b.CFF, + е, (4) 


Under Hypothesis H, we expect relatively high positive coefficients on the operating com- 
ponents OACC and OCFO, a lower positive coefficient on ATA, an even lower positive 
coefficient on AHFS, and relatively small positive or negative coefficients on the non- 
operating components CFI and CFF. Restating Hypothesis Н as restrictions on the coeffi- 


cients in Equation (4): 


14 When our hypotheses about the signs of and differences between coefficients are directional, as is usually the 
case, we say a coefficient or difference of coefficients is significant if the significance level is 5 percent or better 
in a one-tailed test (i.e., t = 1.65). Otherwise, we evaluate significance at the 5 percent level in a two-tailed test 
(i.e., t = 1.96). 
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H-Equation (4): b, = b, > b, > b, > b, = Б, = 0. 


The second column of Table 4, Panel A reports the results of estimating Equation (4). 
Consistent with Hypothesis H-Equation (4), the coefficients on the operating components 
are significantly and similarly positive, with a coefficient of .567 (t = 3.2) on OCFO and 
a coefficient of .632 (t = 2.8) on OACC, and with these coefficients being insignificantly 
different from each other. In contrast, but also as hypothesized, the coefficients on the non- 
operating components are significantly and similarly negative, with coefficients of —.114 
(t = —2.2) on CFI and —.092 (t = —1.8) on CFF, and with these coefficients being 
insignificantly different from each other. The coefficients on OCFO and OACC are both 
significantly more positive than the coefficients on each of CFI and CFF. The coefficient 
on the hybrid operating/non-operating component ATA is significantly positive at .324 (t 
— 2.9). As hypothesized, this coefficient is significantly lower than the coefficients on the 
operating components OCFO and OACC, and it is significantly higher than the coefficients 
on the hybrid operating/non-operating component AHFS and on the non-operating com- 
ponents СЕТ and CFF. The coefficient on the hybrid operating/non-operating component 
AHFS is insignificantly positive at .027. As hypothesized, this coefficient is significantly 
lower than the coefficients on the operating components OCFO and OACC and on the 
hybrid operating/non-operating component АТА, but it is insignificantly higher than the 
coefficients on the non-operating components CFI and CFF. 

Our third, reaggregated returns model is: 


R, = а + БОР, + БАТА, + b,AHFS, + b,NONOP, + е, (5) 


Under Hypothesis H, we expect relatively high positive coefficients on OP, a lower positive 
coefficient on ATA, an even lower positive coefficient оп AHFS, and a relatively small 
positive or negative coefficient on NONOP. Restating Hypothesis H as restrictions on the 
coefficients in Equation (5): 


H-Equation (5): b, > b, > b, > 5, ~ 0. 


The third column of Table 4, Panel A reports the results of estimating Equation (5). 
Student t-tests of the differences between the coefficients are reported in Table 4, Panel B. 
Equation (5) has a higher adjusted R? than Equation (4) and so is the best specified of the 
returns models. 

Consistent with Hypothesis H-Equation (5), the coefficient on OP is significantly pos- 
itive at .616 (t = 3.7). The coefficient on NONOP is significantly negative at —.089 (t 
= –1.9). The coefficient on the hybrid operating/non-operating component АТА is sig- 
nificantly positive at .321 (t = 3.2). As hypothesized, this coefficient is significantly lower 
than the coefficient on OP at the 10 percent level (t — 1.5), and it is significantly 
higher than the coefficients on AHFS (t = 2.6) and NONOP (t = 3.3). The coefficient on 
the hybrid operating/non-operating component AHFS is insignificantly positive. As hy- 
pothesized, this coefficient is significantly lower than the coefficients on OP (t — 3.0) and 
ATA (t — 2.6), and it is higher than the coefficient on NONOP, though insignificantly so. 

In summary, all of the differences of coefficients in Equation (5) have the sign predicted 
under Hypothesis H, and all the coefficient differences are significant at the 5 percent level 
except for difference of the coefficients on OP versus ATA, which is significant at the 10 
percent level, and the difference of the coefficients on AHFS versus NONOP, which is 
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insignificant. Given our maintained assumption that operating activities have stronger val- 
uation implications than non-operating activities, these results imply that the market prices 
OP and ATA as significantly more operating than AHFS and NONOP. The marginally 
significant coefficient difference for OP versus ATA suggests that the market prices hybrid 
trading activities as having a predominantly operating nature, while the insignificant coef- 
ficient difference for AHFS versus NONOP suggests the market price the hybrid activity 
of originating and selling HFS loans as having a predominently non-operating nature. 


Future CFO Analysis 


As in prior research by Wahlen (1994), Dechow et al. (1998), Barth et al. (2001), and 
Kothari (2001), in this section we attempt to corroborate and explain our prior returns 
results in terms of the (fundamental) association of the change in net trading assets and 
other components of cash flows and accruals with future CFO. This analysis should be 
viewed as complementary with, not subordinate to, the returns analysis; the future CFO 
analysis indicates whether the components of cash flow and accruals are economically 
distinct, while the returns analysis indicates whether the marxet perceives these components 
to be economically distinct. Moreover, the future CFO analysis is of interest, independent 
of the returns analysis because SFAC No. 1 (FASB 1978, para. 37) states that a primary 
goal of financial reporting is to “provide information to help investors, creditors, and others 
assess the amounts, timing, and uncertainty of prospective net cash inflows to the related 
enterprise." 

We measure future CFO in year 1 as the sum of CFO for the years t+1 to t+3 deflated 
by market value at the beginning of fiscal year г, denoted FCFO3,.5 We measure future 
CFO over a period longer than a year both because returns should reflect the changes in 
the whole vector of future cash flows and because growth and other factors render the 
association between the cash flow and accrual components and future CFO in any individual 
year very noisy. As reported in Table 3, the correlation of FCFO3 with R is .182, significant 
at the 5 percent level. 

We regress FCFO3 on the same three sets of explanatory variables as in the returns 
regression Equations (3)-(5): 


FCFO3, = a + b,ACC, + b,CFO, + b,CFI, + b,CFF, + е, (6) 
FCFO3, = a + b,OACC, + Ъ,ОСЕО, + b,ATA, + b,AHFS, 

+ b,CFI, + b,CFF, + е, (7) 
FCFO3, = a + БОР, + b,ATA, + b,AHFS, ~ b,NONOP, + e,. (8) 


Consistent with the returns regressions, we deflate all the variables in Equations (6)-(8) by 
beginning of year ¢ market value of equity. In untabulated specification analysis, we find 
that our results are substantively unaffected if we deflate these regressions by average total 
assets instead, as in Barth et al. (2001). Restating Hypothesis H as restrictions on the 
coefficients in Equations (6)-(8): 


15 To ensure that the time value of money does not affect our results, we conducted a specification test using as 
the dependent variable the present value of CFO in years t+1 thorough t3 discounting at a constant 10 percent 
annual rate. Other than the largely mechanical effect of decreasing the absolute magnitudes of our coefficients, 
our results are not affected by this specification test. 
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H-Equation (6): b, = b, > b, > b, > b; = bs = 0, 
-` H-Equation (7): b, ~ b, > b,» b, > b, = b, = 0, and 
H-Equation (8): b, > b, > b} > Ы, = 0. 


As for the returns analysis, we tabulate and discuss in detail our tests.only for the reag- 
gregated model reflected in Hypothesis H-Equation (8), although we briefly discuss the 
untabulated results for the other models as well. 

Because annual CFO is serially correlated and FCFO3 is measured over overlapping 
three-year periods, the residuals from OLS estimations of Equations (6)—(8) are significantly 
positively serially correlated; for example, the first-order autocorrelation of the pooled OLS 
residual in Equation (8) is .65 (x? = 58.6). To address this issue, we estimate these equations 
using maximum likelihood, allowing the residuals for each firm to follow the same AR(1) 
process.!6 

The results of all the future CFO regressions are reported in Table 4, Panel A, with 
Equation (6) represented in the fourth column. Consistent with Hypothesis H-Equation (6), 
the coefficients on CFO and ACC are significantly positive at 1.864 (t — 4.4) and 2.324 (t 
— 5.1), respectively, with the two coefficients being insignificantly different. In contrast, 
.but also as hypothesized, the coefficient on Ше non-operating component CFT is insignifi- 
cantly negative, and the coefficient on the non-operating component CFF is significantly 
negative at the 10 percent level (—.135, t = —1.5); these two coefficients are significantly 
different from each other in large part due to their high negative correlation. As hypothe- 
sized, the coefficients on CFO and ACC are each significantly more positive than the 
coefficients on each of CFI and CFF. Thus, future CFO has a relatively strong positive asso- 
ciation with the primarily operating components CFO and ACC and a relatively weak 
association with the non-operating components CFI and CFF. 

The fifth column of Table 4, Panel A reports the estimation of Equation (7). Consistent 
with Hypothesis H-Equation (7), the coefficients on the operating components are signifi- 
cantly positive, with a coefficient on OCFO of 2.037 (t — 4.7) and a coefficient on OACC 
of 1.229 (t — 2.1), with these two coefficients being significantly different. The coefficients 
on the non-operating components CFI and CFF are insignificantly different from zero; these 
two coefficients are again significantly different from each other. As hypothesized, each of 
the coefficients on OCFO and OACC is significantly more positive than the coefficients on 
each of CFI and CFF. The coefficient on the hybrid operating/non-operating component 
ATA is significantly positive at .854 (t — 4.2). As hypothesized, this coefficient is signifi- 
cantly lower than the coefficients on OCFO, though unexpectedly it is insignificantly lower 
than the coefficient on OACC. As hypothesized, the coefficient on ATA is significantly 
higher than the coefficients on the hybrid operating/non-operating component AHFS and 


16 The actual time-series of the residuals in Equations (6)-(8) and subsequent future CFO models is more com- 
plicated than the AR(1) process we assume in the maximum likelihood estimation, because the measurement of 
FCFO3 over overlapping three-year periods introduces a MA(2) process into the time-series. Given our relatively 
small sample size, allowing for such a complicated time-series process for the residuals would yield inefficient 
estimation of the variance-covariance matrix. For this reason, we assume the simpler AR(1) process, and use 
the appropriate powers of the estimated first-order autocorrelation parameter of .65 as the higher-order autocor- 
relations of the residuals. This approach appears to work well. For example, in the OLS estimation of Equation 
(8), the estimated second-order autocorrelation of the residuals is .47 while the AR(1) process implies this 
autocorrelation is .42 — .65?, and the estimated third-order autocorrelation is .17 while the AR(1) process implies 
this autocorrelation is .27 = .65?. 
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the non-operating components CFI and CFI. The coefficient on the hybrid operating/non- 
operating component AHFS is significantly positive at .388 (t = 2.1). As hypothesized, it 
is significantly less than the coefficients on the operating components OCFO and OACC 
and on the hybrid operating/non-operating component ATA, and it is significantly higher 
than the coefficients on the non-operating components CF7 (at the 10 percent level) and 
CFF. 

The sixth column of Table 4, Panel A reports the results of estimating Equation (8). 
Student t-tests of the differences between the coefficients are reported in Table 4, Panel C. 
Consistent with Hypothesis H-Equation (8), the coefficient on OP is significantly positive 
at 1.980 (t = 4.7) and the coefficient on NONOP (-.108) is insignificantly negative. The 
coefficient on the hybrid operating/non-operating component ATA is significantly positive 
at .924 (t = 4.9). As hypothesized, the coefficient on ATA is significantly lower than the 
coefficient on OP (t = 2.5), and significantly higher than the coefficients on AHFS (t 
= 1.9) and NONOP (t = 4.6). The coefficient on the hybrid operating/non-operating com- 
ponent AHF’ is significantly positive at .486 (t = 2.7). As hypothesized, the coefficient on 
AHFS is significantly lower than the coefficients on OP (t = 3.1) and ATA (t = 1.9), and 
it is significantly higher than the coefficient on NONOP (t = 2.6). 

In summary, in the future CFO regression Equation (8), the coefficient differences all 
have the predicted sign and are significant. Specifically, we find that future CFO has rela- 
tively high positive associations with the operating components of cash flow and accruals, 
a lower positive association with ATA, an even lower positive association with AHFS, and 
insignificant associations with the non-operating components of cash flow. 

Summarizing the results of the returns and future CFO analyses, all the coefficient 
differences in both analyses have the correct signs under Hypothesis H, consistent with our 
argument that trading and originating and selling HFS loans constitute hybrid operating/ 
non-operating activities. Moreover, the results of estimating the returns regression Equation 
(5) and the future CFO regression Equation (8) substantially correspond, consistent with 
the market being aware, at least to some extent, of the hybrid economic natures of trading 
activities and HFS loans. The primary disparity between the two analyses is the coefficient 
differences for OP versus ATA and for AHFS versus NONOP are significant in the future 
CFO analysis but are marginally significant or insignificant in the returns analysis. A pos- 
sible explanation for this disparity is that investors put excess weight on the more predom- 
inant aspect of the nature of a hybrid activity, 1.е., operating for trading activities and non- 
operating for HFS loan activities. 


VI. SPECIFICATION ANALYSES 

Unrealized Gain versus Principal Cash Flows 

As discussed above, ATA equals the unrealized gain plus the principal cash outflow for 
trading positions during the period. We hypothesize that the unrealized gain is more op- 
erating than the principal cash flow, and so it should have a more positive association with 
returns and future cash flows. Because we cannot observe either the unrealized gain or the 
principal cash flows directly," we use trading revenue as proxy for the unrealized gain. 
Trading revenue is observable for 260 (79 percent) of the 330 sample observations on either 


17 Unrealized gains and losses are not observable due to banks’ aggregation of unrealized gains and losses with 
realized gains and losses and other source of non-interest income. The principal cash flows on trading positions 
are not directly observable due to the classification of these cash flows as operating and banks' universal use of 
the indirect method of reporting cash flow from operations. 
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Bank Compustat or the Bank Regulatory Y-9C filings.'* Trading revenue from these data 
sources includes realized gains and may include fee income associated with trading; because 
these additional components of trading revenues are operating, their inclusion does not alter 
the hypothesis stated above. In addition, banks may differ in the extent to which trading 
revenue includes fee income. For these reasons, trading revenue is a heterogeneously noisy 
measure of unrealized gains, likely weakening our results. 

Given the relatively small size of our sample, to avoid losing the observations for which 
trading revenue is not available we set TR equal to trading revenue if it is available and to 
zero otherwise, and add it as an additional explanatory variable in Equations (5) and (8): 


R, = a + ,0, + b,ATA, + b,AHFS, + b,NONOP, + b,TR, + e, (5-ТЁ) 
FCFO3, = а + БОР, + b,ATA, + b,AHFS, + b,NONOP, + b,TR, + e, (8-ТВ) 


In Equations (5-TR) and (8-TR), the coefficients Б. on TR capture the additional valuation 
implications of trading revenue, when disclosed, beyond ATA, which includes both the 
unrealized gain and principal cash flow.'? We hypothesize that TR has incremental positive 
pricing implications beyond ATA that are captured in positive coefficients Б. in Equations 
(5-ТЕ) and (8-TR): 


Table 5 reports the results of estimating Equations (5-TR) and (8-TR). In Equation (5- 
TR), the coefficient on 7R is significantly positive at .723 (t — 2.0), consistent with Hy- 
pothesis H-TR that the market prices trading revenue as having a more operating nature 
than the remainder of ATA. In Equation (8-TR), the coefficient on TR is positive and 
relatively large (.929 compared to a coefficient of .979 on ATA), but not significant. The 
significant coefficient in TR in the returns analysis but insignificant coefficient on TR in the 
future CFO analysis could be due either to the market overvaluing TR or to the measurement 
error in TR yielding more attenuation bias in the future CFO analysis than in the returns 
analysis. 


Dealer versus Non-Dealer Banks 

As discussed above, our sample contains two distinct subsamples: six derivatives deal- 
ers with both trading assets and trading liabilities and 31 non-dealer banks with trading 
assets only. To determine whether these subsamples are heterogeneous, we estimate the 
reaggregated returns and future CFO Equations (5) and (8) allowing the coefficients to 


1$ The Bank Compustat Annual file (item #120) provides trading revenue for 178 of our observations. Bank 
Compustat reports trading revenue as it appears in banks’ annual financial reports, which likely includes all 
trading gains and losses and most trading-related fee income, and so it is our preferred source for trading 
revenue, The bank regulatory Y-9C filings (item #BHCKA220) provide trading revenues for 82 additional ob- 
servations. The Y-9C filings report trading revenue defined as trading gains and losses plus “incidental income 
and expense related to the purchase and sale of” trading assets and liabilities; this likely constitutes a subset of 
fee income streams. For the 89 observations for which trading revenue is observable both on Bank Compustat 
and in the Y-9C filings, the correlation of the two measures of trading revenue is .80. 

19 We also estimated variants of Equations (5-TR) and (8-TR) with the net principal cash outflow measured as TR 
— ATA in place of ATA for the 260-observations sample for which trading revenue is observable. These esti- 
mations yield results similar to those reported in Table 5, significantly positive coefficients on 7R and insignif- 
icant coefficients on TR — ATA. 
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TABLE 5 
Regressions of Returns and Future Cash Flows on Reaggregated Cash Flow and Accrual 
Components and Trading Revenues*^ 


К, = a + БОР, + b,ATA, + b,AHFS, + b,NONOP, + БАТЕ + e, (5-TR) 
ЕСЕОЗ, = a + БОР, + b,ATA, + b,AHFS, + bsNONOP, + Ъ.ТК, + e, (8-TR) 
Returns Future Cash Flows‘ 
Intercept 0.160*** 0.326*** 
(5.8) (2.6) 
OP 0.556*** 2.058*** 
(3.3) (4.9) 
ATA 0.237** 0.979*** 
(2.2) (5.0) 
AHFS 0.004 0.523*** 
(0.1) (3.0) 
NONOP —0.081* —0.153* 
(71.7) (-1.8) 
TR 0.723** 0.929 
(2.0) (.8) 
Adj. R? 0.078 — 
Likelihood Ratio Test (x?) — 57.08 
3t Observations 330 226 


* R denotes share return for the 12 months ending on April 30 following the fiscal year-end; CFO denotes cash 
flow from operations; FCFO3 denotes the sum of the next three years’ CFO; ACC denotes total accruals, i.e., 
the difference between net income and CFO; ATA denotes the change in net trading assets; AHFS denotes the 
change in held-for-sale loans; OCFO denotes CFO + ATA + AHFS; OACC denotes ACC — ATA — AHFS; OP 
denotes OCFO + OACC; CFI denotes cash flow from investing; CFF denotes cash flow from financing; 
NONOP denotes CFI + CFF; TR denotes trading revenue. All variables other than R, are deflated by 
beginning-of-year market value of equity. 

bebe же, and * indicate statistical significance at 1 percent, 5 percent, and 10 percent, respectively, in a one- 
tailed test if directional predictions are made and a two-tailed test otherwise. 

* To mitigate serial dependence of residuals, the estimation of the future cash flow models includes fixed effects 
for each bank (untabulated) and allows each bank's residuals to follow the same AR(1) process. 


differ across the two subsamples.” Specifically, we define a dummy variable, D, equal to 
1 for the six dealers, and 0 otherwise. We interact D with the intercept and each of the 
independent variables in those equations: 


К, = а + арр + БОР, + b,ATA, + b,AHFS, + b,NONOP, 

+ b,D*OP, + bgD*ATA, + b,D*AHFS, + bD*NONOP, + e, (5-р) 
FCFO3, = a + aD + БОР, + b,ATA, + b,AHFS, + b,NONOP, 

+ bD*OP, + b,D*ATA, + b,D*AHFS, + b,D*NONOP, + е. (8-0) 


In these regressions, the coefficients on the uninteracted variables represent the sensitivities 
for the 31 non-dealer banks; the sums of the coefficients on the uninteracted and interacted 





20 The results of estimating the aggregated Equations (3) and (6) and the disaggregated Equations (4) and (7), 
distinguishing dealer and non-dealer banks, yield consistent differences across the dealer and non-dealer bank 
groups as those for the reaggregated Equations (5-D) and (8-D) reported in Table 6. 
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variables represent the sensitivities for the six dealers. The coefficients on the inter- 
acted variables represent the differences in slopes between the dealers and non-dealer banks. 

As a consequence of dealers’ focus on fee-generation, we expect the dealers to generate 
more positive net present value fee streams from their operations than do non-dealer banks. 
Because most of these fee streams (including those for trading) will be reflected in either 
OCFO or OACC, our primary hypothesis is that the coefficient on OP is higher for dealers 
than for non-dealer banks. Because dealers match their trading assets and trading liabilities, 
thereby diminishing the magnitude of net principal cash flows and thus the non-operating 
aspects of trading, we also expect a higher coefficient on ATA for dealers than non-dealer 
banks. Finally, loan origination and other fees for HFS loans that must be deferred under 
SFAS No. 91 (FASB 1986) will be reflected in AHFS; if dealer banks have higher fees of 
this type, then there should also be a higher coefficient on AHFS for dealers than for non- 
dealer banks. Stating these hypotheses as restrictions on the coefficients in Equation (5-D) 
and (8-D): 


H-D-OP: b, > 0. 
H-D-TA: b, > 0. 
H-D-HES: b, > 0. 


Table 6 reports the results of estimating Equations (5-D) and (8-D). As predicted in 
Hypothesis H-D-OP, the coefficients on D*OP of 1.488 (t = 2.7) in Equation (5-D) and 
2.487 (t = 2.0) in Equation (8-D) are both significantly positive, consistent with OP having 
greater valuation implications for the dealers than for the non-dealer banks. As predicted 
in Hypothesis H-D-HFS, the coefficients on D*AHFS of .783 (t = 2.6) in Equation (5-D) 
and 1.316 (t = 2.3) in Equation (8-D) are both significantly positive, consistent with AHF'S 
having greater valuation implications for the dealers than for the non-dealer banks. In 
contrast, the coefficient on the interacted variable D*ATA is insignificant in Equation (5- 
D) and significantly positive at the 10 percent level (.611; t = 1.3) in Equation (8-D), only 
partially and weakly consistent with Hypothesis H-D-TA that trading has a more operating 
character for dealers than for non-dealer banks. 

The consistency of the results across the returns and future cash flow equations in this 
section again suggests that the market has a general understanding of the different valuation 
implications of the various cash flow and accrual components. 


Other Specification Analyses 

We conducted the following additional specification analyses to ensure that our results 
are robust. First, banks merged or acquired each other continuously throughout the sample 
period. Pooling mergers yield restatements of historical financial statements, which along 
with restatements for any other reason constitute an issue when we collect prior years’ data 
from a given year’s financial report filed on EDGAR, as discussed in Section IV. To elim- 
inate significant mergers and acquisitions and restatements, we deleted bank-year obser- 
vations for which assets grew more than 20 percent during the year or for which annual 
earnings from Bank Compustat (originally reported) differs from the annual earnings from 
EDGAR (potentially restated). This eliminated 20 percent of the observations in the primary 
returns analyses. The results are similar for all analyses except for those involving TR 
reported in Table 5, for which the coefficient on TR becomes insignificant. 

Second, because the observations arise from a single industry, they are cross-correlated 
in a given year. In our view, the common industry effects have the same interest as the 
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TABLE 6 
Regressions of Returns and Future Cash Flows on Reaggregated Cash Flow and Accrual 
Components Distinguishing Dealer and Non-Dealer Banks? 


К, = а + врр + БОР, + b,ATA, + b,AHFS, + b,NONOP, 


+ b,D*OP, + b,D*ATA, + b,D*AHFS, + b,D*NONOP, + е, (5-D) 
FCFO3, = a + apD + БОР, + b,ATA, + b,AHFS, + bsNONOP, 
+ b,D*OP, + b.D*ATA, + b,D*AHFS, + b,D*NONOP, + e, (8-D) 
Returns Future Cash Flows‘ 
Intercept 0.192*** 0.363*** 
(6.7) (2.8) 
'OP 0.502*** 1.068** 
(2.8) (2.0) 
D —0227*** 0.132 
(-2.7) (0.4) 
ATA 0.319*** 0.273 
(2.4) (0.9) 
AHFS —0.023 0.359** 
(—0.3) (1.9) 
NONOP —0.074 —0.095 
(—1.5) (—1.1) 
D*OP 1.488*** 2.487** 
(2.7) (2.0) 
D*ATA —0.097 0.611* 
(-0.4) (1.3) 
D*AHFS 0.783** 1.316** 
(2.5) (2.3) 
D*NONOP —0.013 0.066 
(—0.1) (0.3) 
Adjusted R? 0.096 - 
Likelihood Ratio Test (x?) — 50.23 
# Observations 330 226 


а К denotes share return for the 12 months ending on April 30 following the fiscal year-end; СРО denotes cash 
flow from operations; FCFO3 denotes the sum of the next three years’ CFO; ACC denotes total accruals, i.e., 
the difference between net income and CFO; ATA denotes the change in net trading assets; AHFS denotes the 
change in held-for-sale loans; OCFO denotes CFO + ATA + AHFS; OACC denotes ACC — ATA — AHFS; OP 
denotes OCFO + OACC; CFI denotes cash flow from investing; СЕК denotes cash flow from financing; 
NONOP denotes CFI + CFF; D takes a value of 1 for dealer banks, and 0 for non-dealer banks. All variables 
other than К and D are deflated by beginning-of-year market value of equity. 

b***, ** and * indicate statistical significance at 1 percent, 5 percent, and 10 percent, respectively, in a one- 
tailed test if directional predictions are made and a two-tailed test otherwise. 

* To mitigate serial dependence of residuals, the estimation of the future cash flow models includes fixed effects 
for each bank (untabulated) and allows each bank's residuals to follow the same AR(1) process. 


firm-specific effects, so that removing those effects eliminates variation of interest. How- 
ever, we also estimated all models using fixed time effects and using a robust variance 
estimation approach that allows for correlation of residuals across clusters of observations, 
in this case the observations in a given year. The estimated coefficients have the hypothe- 
sized relationships in these analyses, although the t-statistics generally diminish by about 
a third. 
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VH. CONCLUSION 

In this paper we argue that banks' trading activities have a hybrid operating/non- 
operating nature that is not fully captured by the classification of the cash flows on these 
activities as operating during our sample period from 1991—2003 under SFAS No. 102 and 
related accounting practices. Reflecting this hybrid nature, we hypothesize that the change 
in net.trading assets bas a less positive association with returns and future CFO than do 
the pure operating components of cash flows and accruals and a more positive association 
with returns and CFO than do the pure non-operating components of cash flows. We find 
support for this hypothesis on a sample of U.S. banks holding an appreciable amount of 
trading assets. 

We conduct three additional empirical analyses to further investigate how the market 
prices hybrid operating /non-operating flows and how these flows relate to future CFO. First, 
we hypothesize and find that the market prices the change in net trading assets as more 
operating than the change in held-for-sale loans, another important hybrid but more 
investing-oriented activity. Second, for the observations for which we can observe the op- 
erating and non-operating components of the change in net trading assets, we hypothesize 
and find that returns and future cash flows are more positively associated with the trading 
revenue (operating) component than with the principal cash flow (non-operating) compo- 
nent. Third, to address potential heterogeneity in our sample and to examine whether the 
market differentiates firms based on the relative operating nature of their activities, we 
hypothesize and find that derivative dealers’ activities generally are more operating than are 
those of non-dealer banks, although the results for the change in net trading assets are only 
partially and weakly consistent with our hypotheses. 

Collectively, our market pricing and future cash flow results suggest that the market 
appreciates the hybrid operating/non-operating nature of trading activities and also of HFS 
loans and differentiates firms based on the relative operating nature of their activities. This 
appreciation likely reflects the fact that investors can observe the changes in trading posi- 
tions and HFS loans separately from other CFO and accruals, and so they can make ad- 
justments for these changes in their valuation analyses. Our results imply that it is important 
that the cash flows and accruals on hybrid activities, such as trading positions and HFS 
loans, be reported in a disaggregated fashion on the SCF or elsewhere in financial reports, 
regardless of whether these items are classified as operating. 

However, our results also: imply that any single classification of the cash flows on 
trading positions and HFS loans cannot logically capture the hybrid economic natures of 
these activities. Hence, additional disclosures regarding the natures and financial statement 
effects of these and other hybrid activities would be useful to investors and other users of 
financial reports. Specifically, it would be useful to require disclosure of the amoünts of 
principal cash flows and timing differences (unrealized gains and losses) associated with 
these activities; these amounts generally cannot be inferred from current disclosures of the 
net changes in these accounts. 

Our results contribute to the literature on the market pricing of the components of cash 
flow and accruals, which until now has primarily focused on nonfinancial firms. Our paper 
is the first to hypothesize and document empirically that the cash flow from operations of 
financial institutions and trading firms include components with hybrid operating/non- 
operating natures that are priced by the market as such. While we examine a relatively 
small sample of banks with significant trading operations, our results have more general 
importance because a broad set of financial institutions and trading firms have cash flows 
whose hybrid nature is not fully captured by their SCF classification. For example, lessors 
classify the cash flows on operating leases and insurers classify the cash flows on traditional 
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insurance contracts as operating, despite the partly non-operating natures of these activities. 
Financial institutions and trading firms play a large and increasing role in the overall econ- 
omy, and so the SCF classification issues we examine are likely to become more important 
over time. 

Our results are of current interest given the recent highly publicized cases in which 
Enron and other firms issued prepay liabilities that raised funds that were classified as 
operating cash inflows under prior accounting practices. To eliminate firms’ ability to use 
prepays to increase cash flow from operations, FASB required in SFAS No. 149 that the 
cash flows on derivative liabilities with other-than-insignificant initial value be classified as 
financing, and it is likely that this requirement will be applied to trading liabilities generally. 
While FASB’s motivation for SFAS No. 149's requirement is clear and understandable, this 
requirement is conceptually problematic, especially for trading firms with matched trading 
assets and liabilities, because it will cause the cash flows on these trading assets and trading 
liabilities to be classified inconsistently on the SCF. In particular, growing trading firms 
will have operating cash outflows for trading assets and financing cash inflows for trading 
liabilities. | 

We believe that FASB should consider remedying this inconsistency by requiring the 
cash flows on matched trading assets and liabilities to be classified in a consistent fashion 
on the SCF. There are two ways FASB could do this: (1) it could require the cash flows 
on matched trading assets and liabilities to be classified as operating (i.e., partly or wholly 
undo the requirement of SFAS No. 149 and return to prior accounting practices) or (2) it 
could require the (principal) cash flows on trading assets to be classified as investing (1.е., 
modify the requirements of SFAS No. 102) and on trading liabilities to be classified as 
financing (analogous to SFAS No. 149). As discussed above, our results imply that the 
market views the cash flows on trading positions as largely though not entirely operating, 
and so on this basis the first approach appears to be more economically descriptive. On the 
other hand, the second approach might be more robust to cases in which trading assets and 
liabilities are not matched. | 








APPENDIX 
Statement of Cash Flows 
M&T Bank Corporation for Fiscal Year 2000 
(in thousands) 
Cash flows from operating activities: 
Net income (NI) $ 286,156 
+ Provision for credit losses 38,000 
+ Depreciation and amortization of premises and equipment 30,164 
+ Amortization of capitalized servicing rights 24,392 
+ Amortization of goodwill and core deposit intangible 69,576 
+ Provision for deferred income taxes (5,911) 
+ Asset write-downs 1,674 
— Net gain on sales of assets (6,631) 
— Net change in accrued interest receivable (payable) 25,540 
— Net change in other accrued income (expense) (27,901) 
— Ordinary accruals (OACC) 148,903 
= Ordinary cash flow from operations (OCFO)* 435,059 
— Net change in trading assets and liabilities (ATA)* (6,868) 
— Net change in loans held for sale (AHFS)* (81,549) 
= Cash flow from operations (CFO) 346,642 
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Cash flows from investing activities: 

Proceeds from sales of investment securities 771,815 
Proceeds from maturities of investment securities 498,125 
Purchase of investment securities (458,421) 
Additions to capitalized servicing rights (33,694) 
Net increase in loans and leases (1,467,187) 
Capital expenditures, net ` (18,784) 
Acquisitions, net of cash acquired 169,912 
Purchase of bank owned life insurance (35,000) 
Other 13,183 
= Cash flow from Investing Activities (CFT) (560,051) 
Cash flows from financing activities: 

Net decrease in deposits (321,735) 
Net increase (decrease) in short-term borrowings (830,214) 
Proceeds from long-term borrowings 1,000,896 
Payments on long-term borrowings (32,224) 
Purchases of treasury stock (54,947) 
Dividends paid—common (51,987) 
Other 34,830 
= Cash flow from financing activities (CFF) (255,381) 


* Accruals (ACC) = OACC + ATA + AHFS = -$148,903 + $6,868 + $81,549 = —$60,468. 
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ABSTRACT: Research indicates that justifiability of a decision is a key determinant of 
work quality. In this paper, we investigate whether and how the requirement for pre- 
parers to justify an audit conclusion to a reviewer with task preferences that are similar 
to or dissimilar from their explicitly stated initlal task preferences determine the nature 
and extent of justifications documented in their workpapers, and how this effect is 
moderated by their technical and tacit managerial knowledge. Results of an experiment 
with auditors as participants Indicate that auditors use different forms of justifications 
under different circumstances. Auditors with high tacit managerial knowledge and high 
technical knowledge have a higher proclivity to enumerate more pro versus con rea- 
sons, and to consider a wider breadth of Issues when they are required to justify their 
conclusions to reviewers with dissimilar task preferences. In contrast, high tacit man- 
agerlal knowledge auditors required to justify to reviewers with similar Initial task pref- 
erences have a greater tendency to employ an evidence framing approach, in which 
language is used to emphasize consistent evidence by downplaying associated тсоп- 
sistent evidence. In additional analyses, we find that these three specific forms of jus- 
tifications are positively associated with evaluations of work quality. 


Keywords: justification; tacit managerial knowledge; review process. 
Data Avallability: Contact the authors. 


I. INTRODUCTION 
he need to justify a conclusion or point of view is a ubiquitous feature in many 
| business and professional contexts. In such settings, the justifier generally seeks to 
provide support for an expressed viewpoint, and to persuade recipients of its validity. 
For example, managers routinely justify their assessments of the company's past and future 
outlook, and may adopt persuasive strategies in justifying their assessments to investors and 
analysts (Sedor 2002; Kuperman 2003). Financial analysts also provide justifications for 
their research reports, and investors’ reliance on these reports is determined in part by the 
persuasiveness of these justifications (Hirst et al. 1995). 
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In this paper, we focus on justifications in an auditing context, where the quality of 
documented justifications is especially important. Auditing standards mandate that work- 
papers contain sufficient and appropriate justifications that support a documented conclusion 
(SAS No. 96, AICPA 2002). Furthermore, such justifications are referred to as a source of 
evidence by others such as judges, jurors, expert witnesses, and regulators in litigation cases 
(Kennedy et al. 1997; Beus 2003; Ricchiute 2004). In particular, for ambiguous unstructured 
audit tasks with no one single appropriate solution, justifiability or defensibility of the 
documented judgment or decision is a major measure of work quality (Waller and Felix 
1984; Emby and Gibbins 1988; Kennedy et al. 1997). 

Despite its importance, few studies examine the nature and determinants of justifications 
documented by workpaper preparers. Two studies are relevant. Koonce et al. (1995) and 
Tan et al. (1997) provide evidence suggesting that preparers increase the quantity of jus- 
tifications when they are made accountable to a reviewer with unknown preferences. How- 
ever, both papers document effects on ће quantity not the nature of preparers’ justifications, 
and preparers have been posited to vary or stylize the nature of their workpaper documen- 
tation as part of their reputation management strategies (Rich et al. 1997; Gibbins and 
Trotman 2002).! 

Current trends toward real-time reviews involve greater and earlier interactions among 
reviewers and preparers, where reviewers provide front-end guidance to preparers in the 
conduct of the audit (Rich et al. 1997; Gibbins and Trotman 2002). One concern arising 
from real-time reviews is that reviewers communicate their task preferences (that is, their 
preferred audit conclusions on the task) to workpaper preparers too early in the audit process 
(Rich et al. 1997; Wilks 2002). Prior studies suggest a cost to real-time reviews in that 
preparers required to justify to a reviewer with a particular directional preference are sys- 
tematically influenced by knowledge of their reviewer's preference in terms of their hy- 
pothesis generation (Peecher 1996) and evidence interpretation (Wilks 2002). Note that 
preparers in these studies do not form initial audit judgments or conclusions related to the 
task before knowing the reviewer's preference. 

In a departure from these prior studies, we investigate conditions in which preparers 
do explicitly state their task preferences (that is, their preferred audit conclusions on the 
task) before knowing the reviewers' task preferences. Specifically, our first factor— 
justification requirement—trefers to an accountability-inducing requirement (Tetlock 1985a, 
1992, 2002) where the preparer is required to justify his/her audit conclusion to a reviewer 
whose task preference is either similar to or dissimilar from the explicitly stated initial task 
preference of the preparer. Our motivation for looking at this factor can be viewed from 
two perspectives. First, an inference from prior research is that the costs associated with 
real-time review arise in part because preparers now have fewer opportunities to form 
opinions that are dissimilar from that made by the reviewer (in that preparers form a task 
preference only after knowing the reviewer's task preference). From this perspective, it is 
important to know the effects when preparers are required to explicitly form their task 
preferences prior to the reviewer's task preference being communicated to them. Such a 
requirement naturally increases the likelihood that the preparers form task preferences that 
are dissimilar from that formed by the reviewer, and findings on the efficacy of this re- 
quirement would be useful to CPA firms considering firm policies or interventions that 
mitigate limitations of real-time reviews. Second, in practice and particularly during periods 
when reviewers face the practical constraints of tight reporting deadlines and involvement 


! Koonce et al. (1995) find no justification effects on audit procedures and processes (e.g., problem identification, 
information request) adopted by preparers. 
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in several concurrent audit assignments, reviewers likely communicate their conclusions or 
opinions only after preparers have collected evidence and documented their initial conclu- 
sions.” Thus, there are circumstances when preparers naturally have to form their task 
preferences before knowing the reviewers’ task preferences. For these reasons, we expand 
on the dimension of justification requirement, and consider the match or mismatch between 
the initial task preference of the reviewer and that explicitly stated by the preparer; that is, 
whether the justification requirement is to a reviewer with a task preference that is similar 
to or dissimilar from that of the preparer. Because preparers’ responses to these justification 
requirements are likely constrained by the skill sets they possess, we investigate the mod- 
erating effects of two personal attributes: technical knowledge and tacit managerial knowl- 
edge (specifically, knowledge on managing their superior’s impressions). 

We conducted an experiment in which audit preparers performed an ambiguous audit 
task with no one single appropriate solution (doubtful accounts task). In the experiment, 
preparers assessed the appropriateness of the client’s accounting treatment for the allowance 
for doubtful accounts, and explicitly stated their initial task preferences. They were then 
informed of their superior’s task preferences, and wrote memos to their superior justifying 
their final decisions. 

Results show that preparers attempt to justify their conclusions in one of three ways: 
(1) presentation of a greater number of arguments in support of versus against their doc- 
umented conclusion (net persuasive evidence); (2) consideration of a broad spectrum of 
perspectives (e.g., client-risk, internal control risk, GAAP, materiality, etc) (breadth of is- 
sues); and (3) using language to frame evidence so as to highlight evidence consistent with 
the documented conclusion and downplay evidence inconsistent with the documented con- 
clusion (evidence framing). We find that preparers are strategic in their use of these different 
forms of justifications, depending on the context and the personal attributes of the preparers: 
For the first two justification forms, technical and tacit managerial knowledge moderate the 
effects of justification requirements. Preparers document more (less) net persuasive evidence 
when they are required to justify to a reviewer with dissimilar (similar) task preferences, 
but only when preparers possess high tacit managerial and high technical knowledge. We 
find this same pattern of results with respect to the documentation of breadth of issues; in 
addition, we find that preparers with high tacit managerial but low technical knowledge 
also document greater breadth of issues when they are required to justify to a reviewer 
with similar task preferences. The first two forms (net persuasive evidence and breadth of 
issues) involve the listing of relevant evidence related to the auditing issue at hand. In 
contrast, the third form, evidence framing, relates to the use of language to frame pro and 
con evidence. We find that the determinants of and directional effects on evidence framing 
are different from the other two justification forms. In contrast to the other two justification 
forms, evidence framing is used to a greater extent only when high tacit managerial knowl- 
edge preparers are required to justify to a reviewer with similar (as opposed to dissimilar) 
task preferences. Further analysis suggests that these preparers appear to economize on the 
enumeration of pro versus con reasons by treating evidence framing as a substitute for net 
persuasive evidence when justifying to a similar-preference reviewer. We also provide ev- 
idence that all three forms of justifications have direct and positive effects on evaluators' 
assessments of work quality. 


? These observations were gathered from interviews with audit partners and managers from the Big 4 firms. They 
also noted that depending on the audit task and individual auditor, there would be variations in terms of when 
the reviewer's conclusion is communicated to the preparer (i.e., before or after the preparer has formed an initial 
conclusion). 
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Our study contributes to the accounting and psychology literature in a number of ways. 
Prior research in accounting (e.g., Peecher 1996; Wilks 2002) focuses on the construct, 
justification to a superior with known preferences, and shows that it systematically affects 
preparers’ cognition or judgment. We expand on this construct, and demonstrate that what 
matters to the preparer is not simply the reviewer’s task preference per se, but the match 
or mismatch between the reviewer’s task preference and the initial task preference explicitly 
stated by the preparer. A similar point can be made to the related prior psychology literature 
on bolstering and escalating commitment. This literature documents that when people have 
explicitly stated their initial decision, they have a heightened tendency to escalate commit- 
ment to and justify the correctness of that initial decision, even in the face of negative 
feedback (Staw 1976; Fox and Staw 1979; Caldwell and O’Reilly 1982; Brockner 1992). 
Our results suggest that the extent of bolstering or escalating commitment to an explicitly 
stated initial decision may depend on the match or mismatch between this explicitly stated 
initial decision and the preferences of the justifiee.? 

Prior research also suggests that knowledge of the reviewers’ task preferences leads to 
dysfunctional effects on preparers’ performance (Peecher 1996; Wilks 2002). Our study 
suggests that dysfunctional effects may not necessarily be the case. We provide evidence 
that dysfunctional effects may occur if the reviewers’ communicated task preferences are 
similar to that of the preparers, but not when these preferences are dissimilar. In fact, our 
results suggest that benefits may actually accrue in the latter case with respect to listing 
“hard” evidence related to pro versus con reasons and auditing/accounting issues. 

In addition, while the separate effects on auditors’ performance of technical knowledge 
and tacit managerial knowledge have been examined (Tan and Libby 1997), there is no 
empirical evidence on the interactive effects of these two variables, nor how they jointly 
interact with justification requirements.* The closest study we are aware of is by Tan and 
Kao (1999), who show that auditors’ accountability to a superior with unknown preferences 
interacts with technical knowledge. They do not examine the effect of tacit managerial 
knowledge. Our results imply that the effects of accountability and technical knowledge 
are more complex than what has been documented in the extant literature, and that these 
findings may be relatively incomplete without considering the moderating role of tacit 
managerial knowledge. 

Finally, relatively little research has been done on the determinants of justification 
memo quality. A recent tax study finds that various aspects of tax knowledge are associated 
with adaptive information search, which in turn is associated with the quality of justification 
memos (Magro 2005). However, her study does not examine the specific forms of justifi- 
cations that may be adopted by tax professionals in documenting research memos, nor does 
it examine the interactive effects of technical knowledge with tacit managerial knowledge 
and justification requirements. Our study extends the literature on justifications by docu- 
menting the kinds of justifications that preparers naturally engage in, and by providing 
evidence that the nature and extent of these justifications is a complex function of the 
attributes of the preparers and justification requirements. We also provide evidence that 
justification memos using different justification strategies influence memo quality, a finding 
that has important implications on auditors’ defense of their work quality. 


3 Much of the escalating commitment literature invariably focuses on dependent variables such as resource allo- 
cation assessments (see review by Brockner 1992), rather than justification strategies or justification quality 
related to technical issues. 

Tan and Libby (1997) focus on how auditors of different performance ranks (e.g., highly ranked manager versus 
mediocre manager) have different levels of technical knowledge or tacit managerial knowledge. They do not 
examine how these two variables interactively determine performance. 


> 
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The rest of the paper is organized as follows. We develop our hypotheses in Section 
П, describe the research method in Section III, report the results in Section IV, and conclude 
with a discussion of the implications and limitations of the paper in Section V. 


IL BACKGROUND AND HYPOTHESES DEVELOPMENT 
Justification Forms 


In formulating a justification, the justifier attempts to provide support for an expressed 
viewpoint. Put another way, the justifier wants to persuade the target audience that his/her 
viewpoint is valid. While justifications in auditing settings have been previously investigated 
(Koonce et al. 1995; Tan et al. 1997), there is little empirical evidence on the forms of 
justifications employed by workpaper preparers, or features that make a justification per- 
suasive in the eyes of the reviewer. 

Justifications can be viewed as a form of reasoned argument, such that a justification 
is more persuasive the more “reasoned” or logical its argument quality (Petty et al. 1983; 
Petty and Wegener 1991). For instance, following the jurisprudence model by Toulmin and 
colleagues (Toulmin 1958; Toulmin et al. 1984), a reasoned argument consists of a claim, 
along with (a) grounds, considerations, and backings; and (b) rebuttals that acknowledge 
the presence of certain facts/evidence that may invalidate or weaken the claim.? Toulmin 
et al.’s (1984) work suggests that an argument or justification is persuasive if the logical 
components identified above are present. Conditions (a) and (b) together suggest that 
listing both reasons supporting (pro) and refuting (con) the claim can be viewed as provid- 
ing reasoned argumentation. Indeed, the persuasion literature demonstrates that two- 
sided arguments (involving both pro and con arguments) are more persuasive than one-sided 
arguments (incorporating only pro arguments) in that the former approach reduces the 
likelihood of counter-argumentation and enhances message credibility (Pechmann 1992). 
However, if too many con arguments are included, then the persuasiveness of the message 
may be eroded (Toulmin et al. 1984; Golden and Alpert 1987). Hence, we expect reasoned 
arguments to have more pro than con evidence (net persuasive evidence). 

Justifications that carefully assess all considerations (pro and con reasons) related to 
the claim (as suggested by Toulmin et al. [1984]) also likely involve an assessment of a 
broad variety of issues from different angles. In an auditing context, auditors are trained to 
arrive at their conclusions after considering the implications of a variety of issues such as 
the client’s business risk, industry risk, businesses processes and controls, materiality, and 
GAAP (e.g., see Winograd et al. 2000). All things considered, the basis for the conclusion 
can be seen to be comprehensive and persuasive the more such issues are considered; we 
refer to this as breadth of issues. 

Other than the listing of evidence (e.g., net persuasive evidence and breadth of issues), 
argument presentation by way of language influences its persuasiveness (Hirschberg 1990). 
More recently, Areni (2003) draws upon the literature in logic, social psychology, jurispru- 
dence, and sociolinguistics to suggest that persuasive arguments may be viewed as those 
that have certain grammatical features in addition to structural features identified by 
Toulmin et al. (1984). Specifically, justifications may be framed by using language to down- 
play evidence that is inconsistent with one’s conclusion by countering it with related pieces 
of consistent evidence (Kamins and Assael 1987; Hirschberg 1990; Crowley and Hoyer 
1994; see also Rich et al. 1997). This form of framing has been found to influence auditors’ 


5 According to Toulmin et al. (1984), claims can have associated qualifiers (e.g., use of words like possibly, 
certainly, etc.) to reflect the strength of the claim. For example, in the case of some accounting issues such as 
contingent liabilities, the use of probability phrases is prescribed by authoritative guidance. 
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judgments (Tan and Yip-Ow 2001). For example, consider the following sentence justifying 
the need for additional allowance for doubtful accounts: "notwithstanding the use of ex- 
perienced credit agents, companies with lower credit ratings are more likely to go bust." A 
few features of this justification should be noted. First, there is strategic placement of the 
evidence. The merit of the evidence related to the use of experienced credit agents (incon- 
sistent with the documented conclusion) is downplayed by juxtaposing it with a piece of 
evidence that is consistent with the desired conclusion (greater allowance for doubtful 
accounts because accounts receivable may become uncollectible). Second, there is the use 
of semantics in that the word "notwithstanding" serves to link the inconsistent and con- 
sistent evidence, and to downplay the merits of the inconsistent evidence. 

Overall, we examine three forms of justifications that may be used by preparers to 
enhance the persuasiveness of their justifications: net persuasive evidence, breadth of issues, 
and evidence framing. We discuss determinants of their usage below. 


Determinants of Justification Form Usage 


An extensive body of literature shows that accountability, the need to justify decisions 
to an audience, generally leads to greater effort and vigilance (Tetlock 1985a, 1992, 2002). 
Prior accounting studies show that accountability to a superior with unknown preferences 
has debiasing properties and leads to improved performance (Kennedy 1993; Tan and Kao 
1999). Research also shows that the need to justify decisions to a superior with known 
preferences leads auditors to align their judgments and interpret evidence in a manner 
consistent with the superior's preferences (Peecher 1996; Turner 2001; Wilks 2002). How- 
ever, as we posit below, the effects of justification are likely dependent on the match or 
mismatch between the preparer's explicitly stated initial preference and the preference of 
the reviewer. 

The literature on escalating commitment documents a heightened tendency for individ- 
uals to justify the correctness of their explicitly stated initial decisions in the face of negative 
feedback, particularly when they feel personally responsible for the initial decision (Staw 
1976; Fox and Staw 1979; Caldwell and O'Reilly 1982; Brockner 1992). Social psychology 
research also shows that individuals are more (less) likely to engage in deep cognitive 
processing when they are exposed to information that is inconsistent (as opposed to con- 
sistent) with their initial beliefs (Pyszczynski and Greenberg 1981; Wong and Weiner 1981; 
Hastie 1984). Individuals exposed to information that is inconsistent with their initial beliefs 
engage in greater cognitive processing of information to either "explain away” the infor- 
mation or to change their beliefs (Pyszczynski and Greenberg 1981; Wong and Weiner 
1981; Hastie 1984). This cognitive effect is likely stronger when the challenge to one's 
initial belief comes from a superior or evaluator (e.g., а workpaper preparer's reviewer; 
Tetlock 1985a, 1985b). Moreover, in the auditing context, because the preparer's justifica- 
tion requirement is to a superior who evaluates his/her documented work, we anticipate 
greater effort to be exerted in the preparation of the justification memo. 


a 


Rich et al. (1997) also identify other strategies such as selective failure to document evidence and evidence 
fabrication. These strategies involve some potentially unethical behavior of preparers, and fall outside the frame- 
work here (based on reasoned argument and evidence framing). Rich et al. (1997) also identify evidence order 
ав a persuasion strategy. However, because it is not clear whether any particular sequence (all pro then con 
reasons, ог vice versa; interweaving pro and con reasons) is deemed to be more persuasive, we do not posit 
evidence order as a specific justification strategy. We content-analyzed the justifications from participating au- 
ditors in our study, and find no evidence that participants order the pro and con reasons in any systematic way. 
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The direction and manner with which this increased effort is exerted toward docu- 
menting persuasive justifications is likely to vary by the auditor’s tacit managerial knowl- 
edge (henceforth, TMK), which refers to job knowledge related to managing self, others, 
and career (Wagner and Sternberg 1985). TMK has been associated with superior perform- 
ance evaluations within the CPA firm (Tan and Libby 1997). The aspect of TMK most 
relevant here relates to knowledge about managing others, specifically, with respect to 
managing a superior’s impressions of one’s work quality. Preparers with greater TMK are 
likely to be more cognizant of the importance of managing a superior’s impressions, and 
to be more aware of the different justification forms and their efficacies in persuading the 
reviewer that the documented conclusion and associated justifications are carefully thought 
out. 

In particular, preparers with greater TMK are likely more aware of the greater need to 
use even more persuasive justifications (through using the justification forms described 
above) when the reviewer’s preferences are dissimilar than when the preferences are sim- 
ilar.” This is because, in the former instance, if they choose to remain with their explicitly 
stated initial position despite the opposing view held by the reviewer, then these preparers 
are likely to anticipate that the reviewer would want to know why their position is more 
justifiable than that of the reviewer.* Even if the preparers were to adopt the position favored 
by the reviewer, high-TMK preparers are likely more cognizant of the need to invest more 
effort to employ the different justification forms to elaborate on why they changed from 
their initially explicitly stated task preferences—this is to impress upon the reviewer that 
they changed opinions through a reasoned process, and not to merely comply with the 
reviewer’s task preferences.? On the other hand, in the case where the preparer-reviewer 
task preferences are similar, preparers may feel a smaller need to do so because their initial 
(and final) conclusion is favored by the reviewer. 

However, we expect technical knowledge (henceforth, TK) to constrain the extent to 
which these high-TMK preparers are able to successfully employ more of these three jus- 
tification forms (net persuasive evidence, breadth of issues, and evidence framing) when 
the preparer-reviewer preferences are dissimilar. Technical knowledge is needed to employ 
each of these justification forms (Libby and Luft 1993). Compared to preparers with higher 
TK, those with lower levels of TK may not be able to successfully use these justification 
forms even if they wanted to because they may not have the requisite TK to generate the 
pro and con reasons (for net persuasive evidence and evidence framing), and auditing- and 
client-related issues (breadth of issues). 


7 The argument here is that low TMK auditors have less but not no knowledge about these issues. 

* Audit preparers may choose to remain with their initially expressed preference because they are convinced that 
it is the appropriate conclusion. They may also do so as an impression management technique in that superiors 
may value subordinates who are capable of defending a chosen outcome more than those who merely agree 
with what the reviewer prefers. In our study, we asked a Panel of 12 audit partners to assess the importance of 
various forms of actions and behavior on subordinates' career advancement prospects. They rated the ability of 
the subordinates to promote their ideas and convince superiors on the worth of their work (mean rating = 7.58, 
on a scale of 1 — extremely unimportant to 9 — extremely important) as more important than preparers either: 
(a) aligning with superior's convictions of how things need to be done even if it contradicts their own convictions 
(mean — 5.08, p — 0.009); (b) adjusting workpapers to suit reviewer styles and preferences (mean rating — 
5.42, p = 0.001); and (c) being receptive to superior's preferences (mean = 6.08, p = 0.010, all two-tailed 
tests). 

? Preparers who switch task preferences to be aligned with their reviewer's preferences may also be less likely to 
exert less effort to justify their documented conclusion (in that they know that their final task preferences are 
favored by the reviewer). However, this proclivity is likely mitigated by their incentive to manage their reputation 
by demonstrating that they switched preferences for good reasons (and not because they merely followed what 
the reviewer suggested). 
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In summary, we predict that preparers with dissimilar preparer-reviewer preferences are 
more likely to use the three justification forms (net persuasive evidence, breadth of issues, 
and evidence framing), and that this effect is largest for those with high TMK and high 
TK. This leads us to the following hypotheses. 


Hila: Preparers will document more net persuasive evidence (pro less con reasons) to 
justify their documented conclusions when their initial preferred conclusions are 
dissimilar from (versus similar to) those of the reviewers, with the difference 
being higher for preparers with higher levels of TMK and TK. 


Hib: Preparers will document more breadth of issues to justify their documented con- 
clusions when their initial preferred conclusions are dissimilar from (versus sim- 
ilar to) those of the reviewers, with the difference being higher for preparers with 
higher levels of TMK and TK. 


Hic: Preparers will engage in more evidence framing to justify their documented con- 
clusions when their initial preferred conclusions are dissimilar from (versus sim- 
ilar to) those of the reviewers, with the difference being higher for preparers with 
higher levels of TMK and TK. 


Ш. RESEARCH METHOD 
Participants 
One hundred fifteen auditors from two Big 4 firms in Singapore participated in this 
study. They comprised 60 first-year audit staff and 55 third-year audit seniors. 


Independent Variables 

The study employs three independent variables: justification requirement (henceforth, 
JR) (requirement to justify their conclusion to a reviewer with similar or dissimilar task 
preferences), technical knowledge (TK), and tacit managerial knowledge (TMK). The first 
variable, JR, is a between-subjects variable manipulated at two levels. Participants are re- 
quired to justify their audit conclusion (formed after knowing the reviewer's task preference) 
to a reviewer whose initial task preference (i.e., preferred conclusion) is either similar to 
or dissimilar from that of the participants (that is, their explicitly stated initial conclusion 
before knowing the reviewer's preference). 

The second independent variable, TK, is a measured variable. The TK test contains 
five multiple choice questions, and an open-ended question requiring participants to briefly 
explain how an auditor would assess the adequacy of specific allowance for doubtful ac- 
counts. One of the multiple-choice questions pertains to factors to consider in evaluating 
accounting estimates, another pertains to analytical procedures related to the accounts re- 
ceivable balance, while the remaining three questions pertain to accounting issues related 
to the sales and collections cycle of manufacturing clients. The questions are adapted from 
past CPA exams. With respect to the multiple-choice questions, TK is measured in terms 
of the number of correct responses obtained. The open-ended question is given the same 
weight as each of the multiple-choice questions. The response is considered correct if the 
participant provides a reasonable explanation for assessing the adequacy of allowance for 
doubtful accounts, and incorrect otherwise. 

The third variable, TMK is measured using two-work related audit scenarios. These 
are developed by reference to Wagner and Sternberg (1985) and Tan and Libby (1997). 
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Eight options are presented under each scenario, and the rating for each option is on a nine- 
point scale (1 = extremely unimportant to 9 = extremely important). For the first scenario 
relating to managing the superior’s impressions, participants are required to rate the im- 
portance of learning the preferences and styles of the superior, and being receptive to the 
superior’s work styles and documentation preferences. For the second scenario relating to 
ways to progress up the firm’s hierarchy, participants are required to rate the importance 
of strategies such as providing feedback to the superior on one’s work progress, seeking 
feedback from the superior, socializing with the superior in non-work-related activities, 
drawing awareness to one’s achievements, and promoting one’s ideas and work. 

Twelve partners (mean experience = 18 years) from two of the participating firms 
completed the TMK instrument. The partners’ responses serve as the benchmark against 
which the preparers’ TMK is measured. TMK is measured as the sum of the squared 
deviations of a participant’s ratings from the mean ratings of the Panel of partners. Sub- 
sequently, the squared deviation score of each participant is subtracted from the participant 
who obtained the highest score (that is, the worst score, 171.97), such that the higher the 
score, the better the participant’s TMK. 


Task 


The doubtful accounts task is adapted from Hackenbrack and Nelson (1996) and Tan 
and Jamal (2001). It is set in the context of Innovate, a company that has been the client 
of the audit firm for the past five years. The company specializes in producing sound cards 
for personal computers. During the fourth quarter of the past year, the company adopts a 
more liberal credit policy in an attempt to meet its sales target. In order to manage the risk 
from its new credit policy, the company hires two experienced “credit agents” to monitor 
the collectability of the accounts. Hence, the financial controller feels that no additional 
allowance for doubtful accounts is needed. However, he is willing to provide a detailed 
footnote on the above matter. Currently, the company is in need of additional financing to . 
meet its operational needs, and the bank requests an audited set of financial statements to 
discuss the terms of the loan. The case materials include the profit and loss account with 
details on sales, cost of sales, gross margin, bad debt expenses and other expenses for the 
current and prior years, and the debtors' aging analysis for the current year. Given this 
background, participants are to make an assessment on whether to permit a footnote dis- 
closure or to ask for additional allowance for doubtful accounts. 


Procedure 


The experiment consisted of two stages, and was administered during in-house training 
sessions conducted by the participating firms, in the presence of a training manager. To 
enable us to track the participants across the two stages, and to induce accountability and 
motivation, we requested the participants to write down their names and contact numbers 
on the research instrument. In Stage 1, the participants were presented with two sets of 
material placed in separate sealed envelopes to ensure that the participants performed the 
task in the intended sequence. The first envelope contained the task materials. Participants 
first read the case details, assessed their agreement with the client's accounting treatment 


19 To ensure that the items selected for inclusion in the tacit managerial knowledge measure have substantial 
ape among the partners with regard to their level of importance, the partners’ responses to the 16 items 
first evaluated, and only items that have a range of 5 or less are selected. This results in the selection of ten 
йеп. These items have, on average, lower standard deviation (1.19) and skewness (—0.80), than the remaining 

6 items (s.d. = 2.07, skewness = —1.39). This approach is similar to that adopted in Tan and Libby (1997). 
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on a scale of 1 (definitely insist on additional allowance) to 11 (definitely agree with 
footnote disclosure), made a decision on whether to insist on additional allowance or permit 
a footnote disclosure, and then indicated their confidence in the decision.!! After that, the 
participants proceeded to the second envelope containing some problem-solving ability 
questions (based on Tan and Као [1999]) and background information questions.? 

Stage 2 of the experiment was: conducted on a subsequent day of the participants’ 
training. In Stage 2, the participants were presented with two sets of materials placed in 
separate sealed envelopes. The cover sheet on the first envelope informed the participants 
that an audit manager had agreed to participate in the exercise by acting as the audit 
manager in charge of the audit of Innovate. They were then informed of the need to write 
a memo to the manager justifying their decision on the accounting treatment to be used by 
the client. Finally, the instructions stated that, as in an audit review, an audit manager would 
review the memos. The first envelope contained the task with the reviewer's task preferences 
(i.e., whether the reviewer agreed or disagreed with the client's accounting treatment) in- 
dicated on a separate sheet. The instructions stated that the reviewer had briefly reviewed 
the case details and made a preliminary judgment. The participants first read the reviewer's 
task preferences, re-assessed the appropriateness of the client's accounting treatment with 
the knowledge of the reviewer's task preferences (on the same scale as that used in Stage 
1), and indicated their decisions on the appropriateness of the client's accounting treatment 
together with their confidence in the decision. Subsequently, they wrote a memo to the 
reviewer on a separate sheet justifying their decision. The case materials (without the par- 
ticipants' responses to Stage 1) were included in the envelope for their reference. Finally, 
participants proceeded to another envelope that contained the manipulation check, TMK, 
and TK questions. 


Dependent Variables 


The dependent variables for Hla, H1b, and Hlc are the specific forms of preparers’ 
justifications. The first form is net persuasive evidence (Hla), which we define as the 
number of pro minus con reasons related to a documented conclusion. The second form, 
breadth of issues (H1b), relates to the issues covered in the preparers’ justifications, and is 
measured by computing the sum of different auditing and accounting issues (e.g., internal 
control assessment, client related factors, financial statement analysis, industry-related fac- 
tors, GAAP, and materiality) mentioned in the preparers’ justification memos. We elicited 
these issues from an audit manager and two colleagues with auditing experience during a 
pre-test. The third form is evidence framing (H1c), which we identify by observing whether 
evidence is framed toward the preparers’ conclusions (that is, by using semantics to high- 
light evidence consistent with their conclusions and downplay evidence inconsistent with 
their conclusions).'? The Appendix provides examples of the various forms of justification. 


11 The participants also performed ап additional ambiguous audit task of higher complexity related to hedging of 
foreign currency. Task order was counterbalanced, and order does not moderate our findings (p > 0.340). For 
this higher complexity task, participants had to assess whether it was appropriate for a holding company to 
hedge on behalf of its wholly owned subsidiary and, hence, reflect the exchange differences arising on the 
foreign currency liability in its foreign exchange reserves account. Participants found this task very complex, 
and wrote very few justifications. For example, only four participants engaged in evidence framing, and, in the 
dissimilar preparer-reviewer preference condition, only ten participants wrote any form of justification. This 
resulted in cell sizes that are too small for meaningful statistical tests on the effects of JR, TK, and TMK. 

12 Inferences relating to the hypotheses are unchanged with the inclusion of problem-solving ability as a control 
variable. 

13 Among those who used evidence framing, other than one participant, all engaged in only a single instance of 
evidence framing. Hence, we coded evidence framing as 1 if there was at least one instance of evidence framing, 
and 0 otherwise. 
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One of the authors and an independent coder with three years of auditing experience 
content-coded the justifications documented in the participants’ memos. The coders were 
blind to the experimental treatments (that is, the reviewer’s task preferences). The attributes 
coded are the number of pro and con reasons related to a documented conclusion, the audit 
and accounting issues identified (e.g., internal control assessment, client-related factors, 
financial statement analysis, industry-related factors, GAAP, and materiality), and the use 
of evidence framing. In coding the justification forms, disagreements between the two 
coders were resolved through discussion. The inter-rater agreement (Kappa coefficient) is 
high: 0.886 (pro and con statements), 0.930 (breadth of issues), and 0.862 (evidence fram- 
ing, p = 0.000 in all instances). 


IV. RESULTS 
Preliminary Analyses and Manipulation Checks 

As a manipulation check, we ask participants whether the judgment of the audit man- 
ager agreed or disagreed with the client. The majority correctly answered this question, 
and the responses of ten participants who failed the manipulation check are excluded from 
the analyses. Hence, the final sample consists of 105 usable геѕропѕеѕ.!* We also ask 
participants to indicate the extent to which they expect the memos to be reviewed by an 
audit manager (1 = strongly disagree; 9 = strongly agree). The mean rating of 7.33 suggests 
that, on average, participants expect the memos to be subsequently reviewed by an audit 
manager. 
` The mean TK score is 3.66 (varying from 1 to 6), and the mean TMK score is 139.83 
(varying from 0 to 166.36). Cronbach's alpha for the TMK questions is 0.79.5 The two 
independent variables are not significantly correlated (Pearson r = —0.115, p = 0.245, two- 
tailed test). 

To obtain a proxy for effort, we asked participants to indicate the time they started and 
ended the task. Our theory predicts that those who are in the dissimilar-preference condition 
are more likely to exert greater effort than those in the similar-preference condition. Con- 
sistent with our expectations, we find that the dissimilar-preference group spent more time 
writing the memo (mean = 8.74 minutes) than the similar-preference group (mean = 7.70 
minutes, p = 0.088, one-tailed test). 

We aggregate the analyses across firms as the results are not moderated by firm type 
(p > 0.606).!5 


Descriptive Statistics on Forms of Justification 

Panels A and B in Table 1 present the descriptive statistics on the different justification 
forms that preparers use to justify their documented conclusion to a reviewer with either 
similar or dissimilar task preferences. The net persuasive evidence (pro less con arguments) 
ranges from a low of —1 to a high of 6, with an average of 1.63 (1.77) for preparers 
required to justify to a reviewer with similar (dissimilar) task preferences. The breadth of 
issues ranges from a low of 0 to a high of 6 issues, with an average of 2.17 (2.15) issues 


М Inferences relating to the hypotheses are unchanged with the inclusion of the responses of participants who 
failed the manipulation check. 
15 Due to the multidimensional nature of the technical knowledge construct, we do not report any measure of 
reliability (see Bonner and Walker 1994). 
16 48 (45.7 percent) of the participants favor the sufficiency of footnote disclosure, while 57 participants (54.3 
' percent) favor the need for additional allowance in their initial decisions. Decision type (footnote versus allow- 
ance) does not moderate our findings (p > 0.229). 
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TABLE 1 
Descriptive Statistics on Forms of Justification 
(Net Persuasive Evidence, Breadth of Issues, Evidence Framing) 


Standard 
Minimum Maximum Mean Deviation 


Panel A: Forms of Justification Used by Preparers Required to Justify Their Documented 
Conclusion to a Reviewer with Similar Task Preferences (п = 52) 


Net persuasive evidence -1 6.00 1.63 1.21 
Breadth of issues 0 6.00 2.17 0.98 
Evidence framing 0 1.00 0.44 0.50 


Panel B: Forms of Justification Used by Preparers Required to Justify Their Documented 
Conclusion to a Reviewer with Dissimilar Task Preferences (n = 53) 


Net persuasive evidence -1 4.00 1.77 1.17 
Breadth of issues 0 6.00 2.15 1.03 
Evidence framing 0 1.00 0.40 0.49 


Net persuasive evidence refers to the number of pro arguments less the number of con arguments related to a 
documented conclusion. Breadth of issues is the sum of accounting and auditing issues the participants consider 
in justifying their conclusions. Evidence framing refers to whether participants frame their evidence to highlight 
evidence consistent with the documented conclusion and downplay evidence inconsistent with the documented 
conclusion. 


“for preparers required to justify to a reviewer with similar (dissimilar) task preferences.!" 
Finally, on average, 44 (40) percent of the participants engage in evidence framing when 
they are required to justify to a reviewer with similar (dissimilar) task preferences. 


Test of Hypotheses 


To test Hla, Hib, and Hic, separate regression analyses are run for each of the three 
justification forms, with JR, TMK, TK, as well as their interaction terms as independent 
variables. JR is coded as 0 when preparer-reviewer initial task preferences are similar, and 
1 when dissimilar. Both TMK and TK are centered on their means to avoid multicollinearity 
with their interaction terms (Aiken and West 1991). Results of these regression analyses 
are shown in Table 2, Panels A to C. 

Hypotheses 1а, 1b, and 1с predict that preparers will use more net persuasive evidence, 
breadth of issues, and evidence framing, respectively, when their initial preferred conclu- 
sions are dissimilar from (versus similar to) those of the reviewers, with the difference 
being higher for preparers with higher levels of TMK and TK. Table 2, Panel A, shows 
that the JR х TMK X TK term for net persuasive evidence is marginally significant (р 
= 0.094, one-tailed test), providing weak evidence of a three-way interaction. 

To assess the nature of the interaction, we follow the procedure suggested in Aiken 
and West (1991), and test simple effects of JR at high (one standard deviation above mean) 
and low (one standard deviation below mean) levels of TMK and TK. As seen from Table 
2, Panel D, when both TMK and TK are high, there is a significantly positive effect of JR 
on net persuasive evidence (p = 0.014, one-tailed test). The positively significant slope 
suggests that preparers document more pro arguments than con arguments when they are 


17 Participants generally identified issues relating to internal control risk (77.1 percent), trends/ fluctuations in the 
key accounting numbers (56.2 percent), GAAP (23.8 percent), and materiality (22.9 percent). 
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Regression Results of the Effects of Justification Requirement, Preparers’ Technical 
Knowledge and Tacit Managerial Knowledge on Forms of Workpaper Justifications 
(Net Persuasive Evidence, Breadth of Issues, and Evidence Framing) 


(n = 105) 


Panel A: Net Persuasive Evidence as Dependent Variable 


Regression 

Coefficient 
Intercept 1.614 
JR 0.182 
TK —0.011 
TMK —0.008 
JR x TK 0.292 
JR х TMK 0.009 
TK x TMK 0.000 
JR x TK X TMK 6.008 


Panel B: Breadth of Issues as Dependent Variable 


Standard 
Error . 


Regression 

Coefficient 
Intercept 2.161 
JR —0.003 
TK —0.265 
TMK 0.003 
JR x TK 0.532 
JR X TMK 0.000 
TK x TMK —0.004 
JR x TK x TMK 0.008 


Standard 
Error 


0.162 
0.229 
0.143 
0.005 
0.186 
0.009 
0.004 
0.006 


0.135 
0.191 
0.119 
0.005 
0.155 
0.008 
0.003 
0.005 


Panel C: Evidence Framing as Dependent Variable 


on 


Coefficient 
Intercept —0.367 
JR —0.072 
TK —0.095 
TMK 0.044 
JR x TK 0.127 
JR x TMK —0.043 
TK x TMK —0.012 
JR x TK x TMK 0.009 


Standard 
Error 
0.332 
0.439 
0.287 
0.018 
0.356 
0.023 
0.013 
0.015 








t p-value 
9.971 0.000 
0.796 0.428 

—0.079 0.937 
—1.490 0.139 
1.569 0.120 
0.922 0.359 
—0.106 0.916 

1.329 0.094* 

t p-value 
16.034 0.000 
—0.014 0.989 
-2.226 0.028 
0.669 0.505 
3.426 0.001 
-0.011 0.991 
—1.220 0.225 

1.766 0.041* 

Wald value 
1.225 0.268 
0.027 0.870 
0.110 0.740 
5.782 0.016 
0.126 0.722 
3.420 0.064 
0.912 0.340 

0.369 0.272* 


(continued on next page) 
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TABLE 2 (Continued) 


Panel D: Tests of Simple Slopes of Justification Requirement (JR) at Different Levels of 
Preparers’ Technical Knowledge (TK) and Tacit Managerial Knowledge (ТМК) 














Net Persuasive Evidence as Breadth of Issues as Dependent 
Dependent Variable Variable 
Magnitude Magnitude 
of Simple 54. of Simple Std. 
Slope Error t -value Slope Error t p-value 
High TK and 1.059 0.473 2.237  0.014* 0.961 0.394 2.438 00095 
High TMK 
High TK and 0.043 0.422 0.103 0.918 0.376 0.352 1.069 0.288 
Low TMK 
Low TK and —0212 0.465  —0.456 0.650 —0.970 0.387 -2507 0.014 
High TMK 
Low TK and —0.161 0.496 —0.325 0.746 —0.376 0.413 —0.911 0.365 
Low TMK 
* One-tailed test. 


JR = requirement to justify to a reviewer with similar or dissimilar task preferences (0 = similar task 
preferences, 1 = dissimilar task preferences); ‘ 

TK = technical knowledge (measured as the number of correct responses scored by the participants on the 
knowledge test; High TK is 1 standard deviation above the mean TK, and Low TK is 1 standard 
deviation below the mean TK); and 

TMK = tacit managerial knowledge (measured as the sum-of-squared deviations of participants’ ratings from the 
mean ratings of the Panel of partners on the TMK questions. Each participant’s TMK score was then 
subtracted from the participant with the highest score (171.97, that is the worst score). Hence, the 
higher the TMK score the better is the participants’ TMK; High TMK is 1 standard deviation above the 
mean TMK, and Low TMK is 1 standard deviation below the mean TMK). 

Both TK and TMK are centered on their means to avoid multicollinearity with their interaction terms. 





required to justify to a reviewer with dissimilar task preferences than when they are required 
to justify to a reviewer with similar task preferences. There is no statistically significant 
effect of JR when either TK, TMK, or both TK and TMK are low (p > 0.650, two-tailed 
test). These results are consistent with H]a.!? 

Table 2, Panel B, shows a significant three-way interaction of JR, TMK, and TK on 
the breadth of issues (p — 0.041, one-tailed test). As seen from Table 2, Panel D, the results 
of the tests of simple effects of JR are largely similar to that observed earlier. The results 
show a statistically significant positive effect of JR when both TMK and TK are high (p 
— 0.009, one-tailed test), but not when TK is high and TMK low, or when both TK and 
TMK are low (p > 0.288, two-tailed test). However, contrary to expectation, when preparers 
possess low TK and high TMK, there is a significantly negative effect of JR on the breadth 
of issues documented (p — 0.014, two-tailed test). Preparers with low TK and high TMK 
document more issues when they are required to justify to a reviewer with similar task 


18 We separately analyze the pro and con reasons, and find a three-way interaction for pro reasons (p = 0.042). 
For con reasons, the only statistically significant effect we find is a positive main effect of TMK (p = 0.040). 
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preferences than when they are required to justify to a reviewer with dissimilar task pref- 
erences.? These results are partially consistent with НІ. 

Table 2, Panel C, presents the results of the logistic regression analysis on the presence 
or absence of evidence framing. Contrary to expectations, we find no statistically significant 
three-way interaction effect of JR х TMK х TK (p = 0.272, one-tailed test). Instead, we 
find a marginally significant two-way JR х TMK interaction (p = 0.064, two-tailed test). 
Further analysis reveals a marginally significant negative effect of JR when TMK is high 
(p = 0.069, two-tailed test) but not when TMK is low (p = 0.219, two-tailed test). High- 
TMK preparers are more likely to use evidence framing when they are required to justify 
to a reviewer with similar task preferences than when they are required to justify to a 
reviewer with dissimilar task preferences. TK does not moderate this effect, nor does it 
interact with any other variable. The absence of a TK effect may be related to the fact that 
among those who engaged in evidence framing, all but one engaged in only a single instance 
of evidence framing (i.e., involving only one pair of pro and con reasons, but not more 
than that). Thus, the level of TK required to generate the single pair of pro and con reason 
may not be high, and most participants likely have the requisite baseline knowledge. 

The directional effect of JR—high-TMK preparers use more evidence framing in a 
similar task preference JR condition than in a dissimilar task preference JR condition—is 
also opposite to what we anticipated and found for the other two justification forms. We 
speculate that one reason is that high-TMK preparers may view evidence framing as an 
efficient yet effective manner to convince a similar-preference reviewer on the quality of 
their justifications. 2! High-TMK preparers may be aware of the need to show a two-sided 
argument involving both pro and con reasons (Toulmin et al. 1984; Golden and Alpert 
1987), but may also be aware that with respect to a similar-preference reviewer, they may 
be able to economize on the extent to which they need to do so. Therefore, they conserve 
their effort toward efficiently using language to frame evidence that involves downplaying 
a single con reason through emphasizing a single pro reason (rather than engage in enu- 
meration of a range of net persuasive evidence and breadth of issues). Consistent with this 
“efficiency” perspective, recall that literally all preparers using evidence framing engaged 
only in one (not multiple) instance of evidence framing. Also, within the similar-preference 
group, the use of evidence framing is negatively associated with documentation of net 
persuasive evidence (Pearson г = —0.571, р = 0.000, two-tailed test). This is consistent 


19 Tt could be that these high-TMK preparers perceive that, given their low TK, they would be less able to generate 
sufficient breadth of issues needed to persuade a reviewer with dissimilar task preferences (who might require 
substantial breadth of issues to be persuaded). Thus, they focus their effort on a reviewer with similar task 
preferences. It may also be that TMK compensates for the lack of TK in this situation. 

20 In coding breadth of issues documented, different aspects of the same issue (e.g., financial statement analysis) 
are coded as one issue, even though these would have been considered separate pro or con reasons. To assess 
the sensitivity of our results to this coding, we re-analyze the results using the sum of pro and con reasons as 
dependent variable, and obtain similar results as those using breadth of issues as dependent variable. 

21 We find no significant difference in confidence between participants required to justify to a reviewer with similar 
preferences and those required to justify to a reviewer with dissimilar preferences (means — 75.87 and 75.40, 
respectively, p — 0.842, two-tailed test). Furthermore, inferences relating to the hypotheses remain unchanged 
with the inclusion of participants’ confidence in their decision as a control variable. 
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with the argument that evidence framing is viewed as a substitute for net persuasive evi- 
dence.” These findings suggest that preparers strategically use different justification forms 
in reaction to justification requirements, conditional on the skill sets they possess. Overall, 
our predictions are generally supported for two (net persuasive evidence and breadth of 
issues) out of the three forms of justifications. 


Effects of Maintaining versus Switching One’s Initial Task Preferences 

We also perform additional analyses on the 53 (out of 105) participants who were 
required to justify to a reviewer with dissimilar task preferences. Specifically, we investigate 
whether there are any differences in the nature and extent of justification forms documented 
by the participants who changed their decision to align with the reviewer’s expressed task 
preference (“‘switch” group; п = 15, 28 percent) and those who remained with their original 
decision (“no-switch” group; n = 38, 72 percent). As expected, we observe no significant 
differences between the two groups in terms of the use of net persuasive evidence (means 
‚ = 1.60 and 1.84 for the switch and no-switch groups, respectively, p = 0.503, two-tailed 
test) and breadth of issues (means — 2.20 and 2.13 for the switch and no-switch groups, 
respectively, p = 0.829, two-tailed test). However, we find that preparers who switch task 
preferences (and end up with final task preferences similar to that by the reviewer) have a 
y greater propensity to use evidence framing than those who do not switch (means 
= 0.60 and 0.32 for the switch and no-switch groups, respectively, p = 0.062, two-tailed 
test). This result is somewhat consistent with our finding in our main analyses indicating 
that evidence framing is more likely used when preparer-reviewer task preferences are 
similar. It is also possible that evidence framing is used as a means for these preparers to 
acknowledge ''correction" of their initial task preference by highlighting how a pro reason 

(in favor of their revised decision) overrode initial concerns about a con reason.? 


Effects of Justification Forms on Assessments of Memo Quality 


Does greater. documentation of each of the three justification forms lead to better as- 
sessments of memo quality? Presumably, the answer is “yes” because this is likely why 
different preparers engage in differential justification efforts. Auditor respondents in Emby 
and Gibbins’ (1988) survey do perceive that justifiability of a decision is a characteristic 
of good judgment. 

To address this question, we asked two evaluators to assess the quality of the justifi- 
cation memos written by our participants. The evaluators comprised one audit manager with 
11 years of audit experience (including experience in the audit training department), and 


22 [n contrast, within the dissimilar-preference group, there is no statistically significant association between the 
use of net persuasive evidence and evidence framing (r = —0.208, р = 0.136, two-tailed test). Furthermore, the 
negative correlation between the above two justification forms (net persuasive evidence and evidence framing) 
is significantly higher in the similar-preference group than in the dissimilar-preference group (р = 0.05, Fisher 
r-to-Z transformation test). Across all participants, between net persuasive evidence and evidence d bd r 
= —0.393 (p = 0.000); between net persuasive evidence and breadth of issues, r = 0.479 (p = 0.000); and 
between evidence framing and breadth of issues, r = 0.249 (p = 0.010, all two-tailed). 

? Within the switch group, there is no statistically significant association between the use of evidence framing 
and net persuasive evidence (Pearson г = —0.302, р = 0.275, two-tailed test). Furthermore, within the dissimilar- 
preference group, there is no significant difference in confidence levels between participants who change deci- 
sions to align with the reviewer and those who remain with their initial decisions (means = 76.67 and 74.89, 
respectively, p = 0.655, two-tailed test). Finally, within the switch group, we find no main or interactive effects 
of TK or TMK (р > 0.450, two-tailed test). These tests may lack power because of the small sample size (n 
= 15). 
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one professor who was a former audit manager with ten years of audit experience. Eval- 
uators were required to rate the quality of each of the preparers’ justification memos on a 
scale of 0 (low quality) to 10 (excellent quality). To ensure that the memo quality is 
unaffected by the legibility of the preparers’ handwriting, the handwritten justification 
memos are transcribed before being assessed by the reviewers. 

Because the task is an ambiguous audit task for which audit quality is primarily de- 
termined by the justifiability and defensibility of the documented conclusion, the evaluators 
were given explicit instructions to consider the quality of the justifications documented by 
the preparers in support of their stated conclusion, and not base their evaluations on the 
appropriateness of the documented conclusion without evaluating the quality of the justi- 
fications. Both evaluators had different initial task preferences (one favored footnote dis- 
closure, and the other favored additional allowance). There is no significant difference in 
the mean ratings awarded by Evaluator A to the preparers’ memos with conclusions that 
were in agreement (mean = 5.46) and those that were in disagreement with the evaluator’s 
preference (mean = 5.10, p = 0.198, two-tailed test), suggesting no significant effect of 
conclusion congruence on these ratings. However, Evaluator B gave significantly higher 
ratings (p = 0.002, two-tailed test) to preparers’ memos with conclusions that disagreed 
with that of the evaluator (mean = 5.93) and lower ratings to memos with conclusions that 
agreed with the evaluator (mean = 4.73); this is directionally opposite to what is expected 
if the evaluator had merely rewarded conclusion congruence. Thus, the evaluator’s ratings 
appear to be influenced by the quality of the justifications rather than by conclusion appro- 
priateness or congruence. The scores given by the two evaluators are significantly positively 
correlated (Pearson r = 0.429, p = 0.000, two-tailed test). Paired t-tests also show no 
significant differences in the average ratings accorded by the two evaluators to each memo 
(p = 0.663, two-tailed test). We use the average of both evaluators’ ratings as dependent 
variable. 

Table 3, Panel A, reports the results of the regression of the three justification forms 
(independent variables) on evaluated performance (dependent variable). As expected, all 
three forms of justifications are positively associated with evaluated performance (net per- 
suasive evidence, р = 0.015; breadth of issues, р = 0.006; evidence framing, р = 0.001, 
all one-tailed). To assess whether the results hold after controlling for the effects of JR, 
TMK, and TK, we re-perform the analyses above with JR, TMK, and TK, along with their 
interactions, as control variables. As shown in Table 3, Panel B, all three forms of justifi- 
cation are again positively associated with evaluated performance (net persuasive evidence, 
р = 0.020; breadth of issues, р = 0.008; evidence framing, р = 0.001, all one-tailed).2^25 


24 Together, the results satisfy the following two conditions listed by Kenny et al. (1998) to establish that justifi- 
cation form mediates the effects of JR, TK, and TMK on evaluated performance: (a) JR, TK, and TMK affect 
justification form (which we find in the main analyses; Table 2, Panels A to C), and (b) in turn, the mediator 
(i.c., justification form) affects evaluated performance (Table 3, Panel B), even after controlling for the presence 
of JR, ТК, and TMK. For condition (b), if the three-way interaction effect (JR х TK х TMK) is statistically 
insignificant, a total mediation occurs. In our case, the interaction is marginally significant (p = 0.052, two- 
tailed test), which suggests that justification form partially mediates the effects of JR, TK, and TMK on evaluated 
performance. We also re-perform the regression analysis with evaluated performance as dependent variable, and 
JR, TMK, TK, and their interactions as independent variables (1.е., without the three justification forms as 
independent variables). As in our test of Hla, we find a statistically significant three-way interaction (p = 0.010, 
two-tailed test), and a statistically significant simple main effect of JR only when both TMK and TK are high 
(p = 0.015, two-tailed test), but not when either TK, TMK, or both TK and TMK are low (p > 0.207, two- 
tailed test). 

25 The variance inflation factor (VIF) for each of the three justification forms is below 2.2, which is less than the 
cut-off point of 10 that is indicative of multicollinearity problems (Kutner et al. 2004). 
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TABLE 3 
Evaluators’ Assessment of Preparers’ Task Performance (Memo Quality) 
(n = 105) 
Regression Standard 
Coefficient Error t-value p-value 


Panel A: Regression Results of the Effects of Forms of Justification on Evaluators’ Assessment 
of Preparers’ Task Performance (Memo Quality) 


Intercept 3.457 0.301 11.479 0.000 

Net persuasive evidence 0.318 0.143 2.220 0.015* 
Breadth of issues 0.415 0.161 2.580 0.006* 
Evidence framing 1.036 0.310 3.339 0.001* 


Panel B: Regression Results of the Effects of Forms of Justification on Evaluators’ Assessment 
of Preparers’ Task Performance (Memo Quality) (Controlling for the Effects of 
Justification Requirement, Preparers’ Technical Knowledge, and Tacit Managerial 


Knowledge) 

Intercept 3.255 0.336 9.694 . 0.000 

JR 0.461 0.238 1.939 0.055 

TK —0.072 0.153 —0.470 0.639 

TMK 0.000 0.006 - 0.029 0.977 

JR x TK —0.044 0.204 —0.218 0.828 

JR x TMK 0.002 0.010 0.178 0.859 

TK x TMK —0.005 0.004 —1.441 0.153 

JR x TK x TMK 0.012 0.006 1.969 0.052 

Net persuasive evidence 0.304 0.145, 2.089 0.020* 
Breadth of issues 0.409 0.166 2.456 0.008* 
Evidence framing 1.058 0.313 3.375 0.001* 
* One-tailed test. 


Preparers' task performance (memo quality) is assessed by two evaluators (a manager with a Big 4 accounting 
firm, and an academic with prior experience as an audit manager), on a scale of 0 (low quality) to 10 (excellent 
quality). The average of their ratings serves as the dependent variable. 


У. CONCLUSION 

We identify three ways by which preparers attempt to justify their documented conclu- 
sions. We provide the first empirical evidence that the form and extent of justifications vary 
by whether preparers are required to justify their explicitly stated conclusions to a reviewer 
with similar or dissimilar initial task preferences, and the preparers' tacit managerial knowl- 
edge and technical knowledge. We find that high tacit managerial knowledge preparers are 
strategic in terms of the circumstances under which they use particular forms of justifica- 
tions. They have а. greater tendency to use language persuasively in framing their evidence 
(1.е., evidence framing) when they are required to justify to reviewers with similar initial 
task preferences. In contrast, preparers with high tacit managerial knowledge and high 
technical knowledge have a tendency to articulate and enumeraie audit evidence (e.g., listing 
of pro versus con reasons; breadth of issues considered) to provide logical support for the 
stated conclusion when they are required to justify their conclusions to reviewers with 
dissimilar initial task preferences. Thus, the nature and extent of justifications used vary 


The Accounting Review, March 2006 


Determinants of Audit Preparers’ Workpaper Justifications 491 


by environmental factors (here, the preferences of the reviewer) and attributes of the рге- 
parer (here, the preferences of the preparer, and his/her tacit managerial knowledge and 
technical knowledge). 

Readers should note some limitations of this study. First, the justification forms that 
we investigate are likely a subset of the repertoire of justification approaches that preparers 
use. For example, consultation has been documented to be a means of improving the jus- 
tifiability of a decision (Kennedy et al. 1997), but this option was not available to our 
participants because discussion was not permitted in our experiment. Second, our partici- 
pants are staff auditors and seniors, and their justification approaches may differ from those 
of managers, who may have more sophisticated ways of dealing with reviewers’ preferences. 
Third, we consider a setting where the reviewer’s preferences influence the work done by 
preparers. It may well be that the reverse is true—the preparers’ work influences the re- 
viewer’s preferences, which in turn influences the work performed by preparers. Our study 
does not capture this possible interactive relation, which is an avenue for future research. 
Fourth, in evaluating the participants’ memos, evaluators were given explicit instructions to 
consider only the quality of the justifications. This quality measure we obtain may not 
generalize to what occurs in practice because without such instructions, evaluators’ ratings 
may also be influenced by the congruence of the documented conclusion with their preferred 
conclusion rather than by the justification’s quality. Finally, our current study uses Singapore 
auditors as participants, and the results may not generalize to U.S. auditors. This concern 
is mitigated by the fact that our participating auditors are from the Big 4 accounting firms, 
all of which adopt consistent global practices. Prior studies using Singaporean auditor par- 
ticipants have also documented results that are consistent with those using U.S. auditors 
(Tan and Libby 1997; Tan and Kao 1999). 

We conclude by highlighting some implications of our findings. One implication is that 
there are potential benefits that may accrue to the audit review process from having pre- 
parers form and explicitly state their task preferences before becoming aware of the re- 
viewer's task preferences, especially when preparer-reviewer task preferences are dissimilar. 
Furthermore, if the reviewer wants to motivate the preparer into thinking harder about net 
persuasive evidence and breadth of issues to support the documented conclusion, the re- 
viewer could, early in the audit process, play the role of “devil’s advocate" (Schwenk and 
Valacich 1994). The reviewer could first find out or anticipate the preparers' task prefer- 
ences, and then communicate an initial task preference that is opposite to that held by the 
preparer. Similarly, the reviewer could invoke a counter-argument mind-set in preparers by 
prodding them to consider potential alternatives (Kray and Galinsky 2003). However, for 
these mechanisms to be effective, preparers need to possess the requisite tacit managerial 
knowledge and technical knowledge. 

Our finding that the panel of evaluators (comprising a senior audit manager and a 
former audit manager) assessed justification memos using these three justification forms to 
be of higher quality has practical importance. Extant literature argues that decision justifi- 
ability is a characteristic of good audit judgment (Emby and Gibbins 1988), but the relation 
between justifications znd assessed work quality has hitherto not been established. To the 
extent that expert witnesses also assess justification memo quality in the same way, our 
results suggest that CPÀ firms should heed the manner with which justifications are written 
up, as well as their associated determinants. An important avenue for future research is to 
assess whether our findings related to how justification forms influence perceptions of memo 
quality apply to other groups such as expert witnesses and jurors. 


- 
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APPENDIX 
Examples of Forms of Justification 
(Net Persuasive Evidence, Breadth of Issues, and Evidence Framing) 


Panel A: Net Persuasive Evidence 
Preparers’ Conclusion: Require Additional Allowance for Doubtful Accounts 
Pro arguments: 


* Comparing the aging report of 1999 and 2000, 2000 sees an increase in the debts 
between 0—90 days. 


Con arguments: 

e Hired two experienced “credit agents” to monitor collectability. 
Preparers’ Conclusion: Footnote Disclosure is Sufficient 
Pro arguments: 

* Client has implemented additional controls to deal with the change in credit policy. 
Con arguments: 

* The credential of new debtors has decreased since they are smaller companies with 

lower credit ratings. Thus, risk of unrecoverability increases correspondingly. 

Panel B: Breadth of Issues 
Internal control and related risk assessment 


* Company hired two experienced "credit agents" whose task is to monitor accounts 
receivable collections 


Financial statement analysis: 


• Company's aging accounts receivable for 91-120 days and greater than 120 days 
has decreased (2 percent). 


Materiality 

e Sales made to new customers were not material to the total sales for the year. 
GAAP 

• Based on the principle of prudence, suggest additional allowance. 
Client-related factors: 


e Historically a good client for the last 5 years, hence, may have comfort with man- 
agement representation. 


Industry-related factors 

• Volatile nature of the industry. 
Panel C: Framing Evidence to be Consistent with the Conclusion 
Preparers’ Conclusion: Require Additional Allowance for Doubtful Accounts 


* The company is very confident about its new strategy and feels that no additional 
accrual is necessary and merely a footnote is enough. However, based on the 
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principle of prudence and the volatile nature of the industry, suggest additional 
allowance. 


Preparers’ Conclusion: Footnote Disclosure is Sufficient 


* Although management adopted a more liberal credit policy by selling to smaller 
companies with lower credit ratings, management hired experienced credit agents to 
monitor accounts receivable collections, and, as such, no additional provision is 
required. 
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Editorial Data 


The following table contains information about turnaround time for manuscripts (including revisions) 
on which editorial decisions were made in the 12-month period ended December 31, 2005. Turnaround 
time is the number of days between the date that the manuscript was received and the date of the 
editor's letter to the author(s). The number in parentheses reflects June-August 2005, under incoming 
Editor Dan Dhaliwal. 





Number of Cumulative Cumulative 
Manuscripts Percent Number Percent 
0 = Days = 30 84 18.06 84 18.06 
(11) 
31 = Days = 60 183 39.36 267 57.42 
(43) 
61 = Days = 90 140 30.11 407 87.53 
(26) 
91 = Рауз = 120 49 10.54 456 98.07 
(13) 
121 = Days 09 1.93 465 100.00 
_ 2 NE 
Total 465 100.00 
(95) 


The mean review time was 55.93; the median review time was 77 days. 


New Submissions by calendar year (January 1 to December 31) 


1992 241 
1993 234 
1994 231 
1995 195 
1996 230 
1997 195 
1998 196 
1999 239 
2000 260 
2001 260* 
2002 324 
2003 327 
2004 307 


2005 (1/1-8/31) 267** (8 months) 
ж 2001--іп addition to the 260 regular submissions, 68 MSs were submitted to the ТАК Quality of 
Earnings Conference. 
** Projects out to 396 new submissions for 2005. 


The acceptance rate (defined as number of MSs accepted divided by the number of new submissions) 
over the past 5 years is between 12-16% per year. 
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EDITORIAL POLICY AND STYLE INFORMATION 


EDITORIAL POLICY 


According to the policies set by the Publications Committee (which were endorsed by the Executive 
Committee and were published in the Accounting Education News, June 1987), The Accounting Review 
“should be viewed as the premier journal for publishing articles reporting the results of accounting research 
and explaining and illustrating related research methodology. The scope of acceptable articles should embrace 

“any research methodology and any accounting-related subject, as long as the articles meet the standards 
established for publication in the journal ... No special sections should be necessary. The primary, but not 
exclusive, audience should be—as it is now—academicians, graduate students, and others interested in ac- 
counting research.” 

The primary criterion for publication in The Accounting Review is the significance of the contribution 
an article makes to the literarure. 

The efficiency and effectiveness of the editorial review process is critically dependent upon the actions 
of both authors submitting papers and the reviewers. Authors accept the responsibility of preparing research 
papers at a level suitable for evaluation by independent reviewers. Such preparation, therefore, should include 
subjecting the manuscript to critique by colleagues and others and revising it accordingly prior to submission. 
The review process is not tc be used as a means of obtaining feedback at early stages of developing the 
research. 

Reviewers and associate editors are responsible for providing critically constructive and prompt evalu- 
ations of submitted research papers based on the significance of their contribution and on the rigor of analysis 
and presentation. Associate editors also make editorial recommendations to the editor. 


MANUSCRIPT PREPARATION AND STYLE 


The Accounting Review's manuscript preparation guidelines follow The Chicago Manual of Style (15th ed.; 
University of Chicago Press). Another helpful guide to usage and style is The Elements of Style, by William 
Strunk, Jr., and E. B. White (Macmillan). Spelling follows Webster's Collegiate Dictionary. 


FORMAT 


1. All manuscripts should be printed in 12-point font on one side of 89, х 11" good quality paper and be 
double-spaced, except for indented quotations. 

2. Manuscripts should be as concise as the subject and research method permit, generally not to exceed 
7,000 words. . 

3. Margins of at least one inch from top, bottom, and sides should facilitate editing and duplicatio 

4. To promote anonymous review, authors should not identify themselves directly or indirectly in their papers 
or in experimental test instruments included with the submission. Single authors should not use the 
editorial "we." | 

5. А cover page should show the title of the paper, the author’s name, title and affiliation, email address, 
any acknowledgments, and a footnote indicating whether the author would be willing to share the data 
(see last paragraph in this statement). 3 


Pagination: All pages, including tables, appendices and references, should be serially numbered. Major 
sections should be numbered in Roman numerals. Subsections should not be numbered. 


Numbers: Spell out numbers from one to ten, except when used in tables and lists, and when used with 
mathematical, statistical, scientific, or technical units and quantities, such as distances, weights and measures. 
For example: three days; 3 Kilometers; 30 years. All other numbers are expressed numerically. 
Percentages and Decimal Fractions: In nontechnical copy use the word percent in the text. 


Hyphens: Use a hyphen to join unit modifiers or to clarify usage. For example: а well-presented analysis; 
re-form. See Webster’s for correct usage. 


Keywords: The abstract is to be followed by four keywords that will assist in indexing the paper. 
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ABSTRACT/INTRODUCTION 


An Abstract of about 100 words should be presented on a separate page immediately preceding the text. 
The Abstract should concisely inform the reader of the manuscript’s topic, its methods, and its findings. 
Keywords and the Data Availability statements should follow the Abstract. The text of the paper should start 
with a section labeled “I. Introduction,” which provides more details about the paper’s purpose, motivation, 
methodology, and findings. Both the Abstract and the Introduction should be relatively nontechnical, yet 
clear enough for an informed reader to understand the manuscript’s contribution. The manuscript’s title, but 
neither the author’s name nor other identification designations, should appear on the Abstract page. 


TABLES AND FIGURES 


The author should note the following general requirements: 

1. Each table and figure (graphic) should appear on a separate page and should be placed at the end of the 
text. Hach should bear an Arabic number and a complete title indicating the exact contents of the table 
or figure. Tables and figures should define each variable. The titles and definitions should be sufficiently 
detailed to enable the reader to interpret the tables and figures without reference to the text. 

A reference to each graphic should be made in the text. 

. The author should indicate by marginal notation where each graphic should be inserted in the text. 
Graphics should be reasonably interpreted without reference to the text. 

Source lines and notes should be included as necessary. 


WA ыр 


Equations: Equations should be numbered in parentheses flush with the right-hand margin. 


DOCUMENTATION 


Citations: Work cited should use the "author-date system” keyed to a list of works in the reference list (see 
below). Authors should make an effort to include the relevant page numbers in the cited works. 


1. In the text, works are cited as follows: authors’ last name and date, without comma, in parentheses: for 
example, (Jones 1987); with two authors: (Jones and Freeman 1973); with more than two: (Jones et al. 
1985); with more than one source cited together (Jones 1987; Freeman 1986); with two or more works 
by one author: (Jones 1985, 1987). 

2. Unless confusion would result, do not use “р.” or “pp.” before page numbers: for example, (Jones 1987, 
115). 

3. When the reference list contains more than one work of an author published in the same year, the suffix 
a, b, etc. follows the date in the text citation: for example, (Iones 1987а) or (Jones 1987a; Freeman 
1985b). 

4. If an author's name is mentioned in the text, it need not be repeated in the citation; for example, “Jones 
(1987, 115) says ..." 

5. Citations to institutional works should use acronyms or short titles where practicable; for example, (AAA 
ASOBAT 1966); (AICPA Cohen Commission Report 1977). Where trief, the full title of an institutional 
work might be shown in a citation: for example, (ICAEW The Corporate Report 1975). 

6. If the manuscript refers to statutes, legal treatises or court cases, citations acceptable in law reviews 
should be used. 


Reference List: Every manuscript must include a list of references containing only those works cited. Each 
entry should contain all data necessary for unambiguous identification. With the author-date system, use the 
following format recommended by The Chicago Manual: 


1. Arrange citations in alphabetical order according to surname of the first author or the name of the 
institution responsible for the citation. 

Use author's initials instead of proper names. 

. Date of publication should be placed immediately after author's name. 

. Titles of journals should not be abbreviated. 

. Multiple works by the same author(s) in the same year are distinguished by letters after the date. 

. Inclusive page numbers are treated as recommended in Chicago Manual section 17.168.B. 


Sample entries are as follows: 


American Accounting Association, Committee on Concepts and Standards for External Financial Reports. 
1977. Statement on Accounting Theory and Theory Acceptance. Sarasota, FL: AAA. 


CN ta шо [У 
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Demski, J. S., and D. E. M. Sappington. 1989. Hierarchical structure and responsibility accounting. Journal 
of Accounting Research 27 (Spring): 40-58. 

Dye, R., B. Balachandran, and R. Magee. 1989. Contingent fees for audit firms. Working paper, Northwestern 
University, Evanston, IL. 

Fabozzi, Е, and 1. Pollack, eds. 1987. The Handbook of Fixed Income Securities. 2nd edition. Homewood, 
IL: Dow Jones-Irwin. 

Kahneman, D., P. Slovic, and A. Tversky, eds. 1982. Judgment Under Uncertainty: Heuristics and Biases. 
Cambridge, U.K.: Cambridge University Press. 

Porcano, T. M. 1984a. Distributive justice and tax policy. The Accounting Review 59 (October): 619-636. 

. 1984b. The perceived effects of tax policy on corporate investment intentions. The Journal of the 
American Taxation Association 6 (Fall): 7-19. 

Shaw, W. H. 1985. Empirical evidence on the market impact of the safe harbor leasing law. Ph.D. dissertation, 
The University of Texas at Austin. 

Sherman, T. M., ed. 1984. Conceptual Framework for Financial Accounting. Cambridge, MA: Harvard 
Business School. 





Footnotes: Footnotes are not used for documentation. Textual footnotes should be used only for extensions 
and useful excursions of information that if included in the body of the text might disrupt its continuity. 
Footnotes should be consecutively numbered throughout the manuscript with superscript Arabic numerals. 
Footnote text should be doubled-spaced and placed at the end of the article. 


SUBMISSION OF MANUSCRIPTS 
Authors should note the following guidelines for submitting manuscripts: 


1. Manuscripts currently under consideration by another journal or publisher should not be submitted. The 
author must state that the work is not submitted or published elsewhere. 

2. In the case of manuscripts reporting on field surveys or experiments, four copies of the instrument 
(questionnaire, case, interview plan, or the like) should be submitted. Information that might identify the 
author(s) must be deleted from the instrument. 

3. MS should be submitted via email as Microsoft® Word or PDF files to the ТАК office at TARUAZ@ 
email.arizona.edu. A submission letter, a separate cover page, and the MS containing no author identifi- 
cation should be emailed as separate files. The submission fee should be paid online (VISA or MasterCard 
only) at: https://aaahq.org/ AA Aforms/journals/tarsubmit.cfm. If unable to pay online, a check in U.S. 
funds, made payable to the American Accounting Association, may be sent to Professor Dan Dhaliwal, 
Senior Editor, The Accounting Review, The University of Arizona, Department of Accounting, McClelland 
Hall 301, 1130 E. Helen Street, Tucson, AZ 85721-0108. The submission fee in U.S. funds is $125.00 
for members and $200.00 for nonmembers of the AAA. Effective January 1990, the submission fee is 
nonrefundable. : 

. The author should retain a copy of the paper. 

. Revisions must be submitted within 12 months from request, otherwise they will be considered new 
submissions. 


іл E 


COMMENTS 


Comments on articles previously published in The Accounting Review will be reviewed (anonymously) by 
two reviewers in sequence. The first reviewer will be the author of the original article being subjected to 
critique. If substance permits, a suitably revised comment will be sent to a second reviewer to determine its 
publishability in The Accounting Review. If a comment is accepted for publication, the original author will 
be invited to reply. All other editorial requirements, as enumerated above, also apply to proposed comments. 


POLICY ON REPRODUCTION 


An objective of The Accounting Review is to promote the wide dissemination of the results of systematic 
scholarly inquiries into the broad field of accounting. 

Permission is hereby granted to reproduce any of the contents of the Review for use in courses of 
instruction, as long as the scurce and American Accounting Association copyright are indicated in any such 
reproductions. 
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Written application must be made to the American Accounting Association, 5717 Bessie Drive, Sarasota, 
FL 34233-2399, for permission to reproduce any of the contents of the Review for use in other than courses 
of instruction—e.g., inclusion in books of readings or in any other publications intended for general distri- 
bution. In consideration for the grant of permission by the Review in such instances, the applicant must 
notify the author(s) in writing of the intended use to be made of each reproduction. Normally, the Review 
will not assess a charge for the waiver of copyright. 

Except where otherwise noted in articles, the copyright interest has been transferred to the American 
Accounting Association. Where the author(s) has (have) not transferred the copyright to the Association, 
applicants must seek permission to reproduce (for all purposes) directly from the author(s). 


POLICY ON DATA AVAILABILITY 


The following policy has been adopted by the Executive Commitiee in its April 1989 meeting. “An 
objective of (The Accounting Review, Accounting Horizons, Issues in Accounting Education) is to provide 
the widest possible dissemination of knowledge based on systematic scholarly inquiries into accounting as 
a field of professional research, and educational activity. As part of this process, authors are encouraged to 
make their data available for use by others in extending or replicating results reported in their articles. 
` Authors of articles which report data dependent results should footnote the status of data availability and, 
when pertinent, this should be accompanied by information on how the data may be obtained." 
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MARCH 2006 PLACEMENT ADS 


The deadline for information to be included in the “Placement Information” section 
is two months prior to the date of publication. The Review is published in January, 
April, July, and October. Placement advertising and replies should be sent to the 
American Accounting Association, 5717 Bessie Drive, Sarasota, FL 34233-2399, 
ATTENTION: ADVERTISING DEPT. All replies to box numbers should be on sep- 
arate sheets to facilitate forwarding. 


AMERICAN UNIVERSITY OF BEIRUT, Olayan School of Business (OSB) is seeking appli- 
cations for a renewable three-year appointment in all accounting areas with a competitive salary 
and benefits. One or more positions are available at all three ranks. Successful applicants should 
possess an earned Doctorate by the starting date of September 15, 2006. Responsibilities include 
teaching, research, and services. The American University of Beirut (AUB), founded in 1866 and 
chartered in the State of New York, is the Middle East’s premier university. The 70-acre campus 
is situated in Ras Beirut peninsula next to the Mediterranean Sea. The OSB serves 1,400 students 
and offers B.B.A., M.B.A., and E.M.B.A. programs. OSB is currently seeking AACSB. AUB is 
committed to Equal Opportunity and Equity regardless of gender, race, and ethnic background. 
Contact: George K. Najjar, Dean, OSB-AUB, PO Box 11-0236, Beirut, Lebanon. Voice: (9611) 
352-700; Fax: (9611) 750-214; Email: gnajjar@aub.edu.lb. 


KONKUK UNIVERSITY, School of Business is seeking a full-time Visiting Professor for a period 
of one or two semesters (starting March 2006/September 2006/March 2007). Applicants should 
possess a Ph.D. in a related field of business. The essential function of the position is teaching 
two undergraduate courses per semester. If the professor stays for one semester, he/she only 
needs to teach 15 weeks with a full-time salary for 6 months. The same rule applies for one year, 
i.e., teaching only 30 weeks with a full salary for 12 months. Salary ranges between U.S.$55,000/ 
yr and $75,000/yr (negotiable). Extra benefits include Korean national medical insurance (half 
paid), free campus accommodation (utilities included) and moving expense support (up to $2,000). 
Contact: Thomas Joh, Dean of Business School, Konkuk University, School of Business, 
Hwayang-dong 1, Kwangjin-ku, Seoul, Korea 143-701; Email: thjoh@konkuk.ac.kr; Website: 
http: //biz.konkukkr/english/main1.asp. 


UNIVERSITY OF TENNESSEE, KNOXVILLE is accepting applications for a full-time, tenure- 
track position (Assistant or Associate Professor) to begin August 1, 2006. The position is for 
Information Management (Systems) within the Department of Accounting and Information Man- 
agement. Accordingly, candidates should hold a doctoral degree with a specialization in account- 
ing information systems, management information systems, or other comparable degree, or expect 
to complete all degree requirements by starting date. Please send applications to: Kenneth E. 
Anderson, Search Committee Chair, Department of Accounting and Information Management, 
CBA, University of Tennessee, Knoxville, Tennessee 37996-0560. Email: kea@utk.edu. UT Knox- 
ville is an EEO/AA/Title VI/Title IX/Section 504/ ADA/ADEA institution in the provision of 
its education and employment programs and services. The University welcomes and honors people 
of all races, genders, creeds, cultures, and sexual orientations, and values intellectual curiosity, 
pursuit of knowledge, and academic freedom and integrity. 
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204 VILLANOVA UNIVERSITY, Department of Accountancy invites applications at all levels for a 
tenure-track position. All research and teaching areas will be considered, but some priority may 
be given to teaching Financial Accounting or AIS. The candidate should be capable of high- 
quality research and teaching at both the undergraduate and graduate levels. A doctorate in Ac- 
counting is required. The Department of Accountancy's programs are AACSB-accredited and our 
graduates are vigorously recruited by professional services firms and financial institutions. Vil- 
lanova University is a Roman Catholic university, sponsored by the Augustinian Order. An AA/ 
EEO Employer, Villanova seeks a diverse faculty committed to scholarship, service, and especially 
teaching, who understand, respect, and can contribute to the University’s mission and values. 
Interested candidates should write to: Wayne G. Bremser, Ph.D., Chairperson, Department of 
Accountancy, College of Commerce and Finance, Villanova University, Villanova, PA 19085- 
1678, or apply by email (attach curriculum vitae): wayne.bremser@ villanova.edu. 


UNIVERSITY OF MICHIGAN, Ross School of Business has a tenure-track position available in 
Accounting starting in the 2006 academic year. The application should include a research paper 
based on the candidate's dissertation, any other published or unpublished research, evidence of 
teaching experience (if any), and a curriculum vita that includes three references. Applicants 
should have primary research and teaching interests in financial accounting. Applications should 
be sent by mail only to: Professor Patricia Dechow, Stephen M. Ross School of Business, Uni- 
versity of Michigan, 701 Tappan Street, Ann Arbor, MI 48109-1234. The University of Michigan 
offers competitive salary and fringe benefits and is an Affirmative Action/Non-Discriminatory 
Employer. 


UNIVERSITY OF MINNESOTA, Curtis L. Carlson School of Management invites applications 
for tenure-track faculty positions in accounting at all ranks. We offer a stimulating research 
environment with outstanding synergies across all accounting research paradigms. Salary and 
benefits are competitive. All applicants must possess a doctoral degree (or be close to completion), 
and must have a strong commitment to accounting research and teaching. Applicants for senior 
rank must have an established record of research publications in top-tier academic journals and 
must present evidence of excellence in teaching. Send applications, including a curriculum vita, 
samples of research papers, and at least one letter of recommendation from an established scholar 
to: Recruiting Conference Chair, Department of Accounting, Carlson School of Management, 
University of Minnesota, 321-19th Avenue South, Minneapolis, MN 55455. Review of applica- 
tions begins November 15, 2005 and will continue until positions are filled. The University of 
Minnesota is committed to the policy that all persons shall have access to its programs, facilities, 
and employment without regard to race, color, creed, religion, national origin, sex, age, marital 
status, disability, public assistance status, veteran status, or sexual orientation. 


GEORGETOWN UNIVERSITY, McDonough School of Business invites applications for tenure- 
track faculty positions at all levels for Fall 2006. Applicants must have (or be close to completing) 
8 Ph.D. in Accounting or a related field, and have a strong commitment to scholarly research and 
excellent teaching. Applicants should submit a letter of interest, curriculum vita, recommenda- 
tions, and samples of publications:or working papers to: Professor Patricia Fairfield, Georgetown 
University, McDonough School of Business, Washington, D.C. 20057. Georgetown University is 
an Affirmative Action/Equal Opportunity Employer. 
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208 ARIZONA STATE UNIVERSITY-EAST COLLEGE Business Administration program at the 
Polytechnic Campus seeks candidates for a nine-month tenure-track position at the Assistant or 
Associate Professor level in the area of Accounting. Please go to http:// www.eoaa.asu.edu/jobop/ 
jobop.asp for more details. Applications must include a letter of intent, curriculum vitae, the 
names, addresses, and telephone numbers of three references, and evidence of recent scholarly 
activity. Inquiries should be directed to: Dr. Timothy J. Richards, Morrison School of Agribusiness 
and Resource Management, Arizona State University East, 7001 East Williams Field Road, Mesa, 
AZ 85212; Phone: (480) 727-1488; Fax: (480) 727-1961; Email: trichards@asu.edu. Electronic 
applications and supporting materials must be sent to ProfSearch@asu.edu. Arizona State Uni- 
versity is an Equal Opportunity/ Affirmative Action Employer. A background check is required 
for employment. 


MIDWESTERN STATE UNIVERSITY Accounting Department invites applications for a tenure- 
track faculty position effective Fall 2006. A doctorate (or short-term ABD status) in accounting 
from an AACSB-accredited university or a terminal degree appropriate to the candidate's area of 
specialization and appropriate professional certification are required. A record of teaching excel- 
lence in Financial Accounting and a continuing record of scholarly research and publication are 
preferred. Appointment will be at a rank appropriate to qualifications. Compensation is compet- 
itive. Interested candidates should send a letter of application, current curriculum vitae, unofficial 
transcripts, and a list of three references to: Dr. Ralph Fritzsch, Chair, Department of Accounting, 
Dillard College of Business Administration, Midwestern State University, 3410 Taft, Wichita Falls, 
TX 76308. Website: http://business.mwsu.edu/accounting/index.asp; Email: ralph.fritzsch & 
mwsu.edu. EEO/ADA. 


UNIVERSITY OF TENNESSEE, KNOXVILLE is accepting applications for a full-time, tenure- 
track position (Assistant or Associate Professor) to begin August 1, 2006. The position is for 
Financial Accounting with an international and capital markets emphasis, within the Department 
of Accounting and Information Management. Accordingly, candidates should hold a doctoral 
degree with a specialization in these areas, or expect to complete all degree requirements by his 
or her starting date. Please send applications to: Kenneth E. Anderson, Search Committee Chair, 
Department of Accounting and Information Management, CBA, University of Tennessee, Knox- 
ville, TN 37996-0560. Email: kea@utk.edu. UT Knoxville is an EEO/AA/Title VI/Title ЇХ/ 
Section 504/ ADA/ ADEA institution in the provision of its education and employment programs 
and services. The University welcomes and honors people of all races, genders, creeds, cultures, 
and sexual orientations, and values intellectual curiosity, pursuit of knowledge, and academic 
freedom and integrity. 


UNIVERSITY OF NOTRE DAME, Department of Accountancy invites applications for position 
openings at the Assistant and Associate Professor levels. All research areas will be considered. 
Applicants for Assistant Professor positions should expect to complete doctoral degree require- 
ments before the start of the contract period. All candidates should have a strong commitment to 
scholarly research and excellence in teaching. Salary is very competitive and research support is 
excellent. Please send curriculum vitae and an example of scholarly work to: Professor Tom 
Schaefer, Chair, Department of Accountancy, Room 102D, Mendoza College of Business Ad- 
ministration, University of Notre Dame, Notre Dame, IN 46556-5646. The University of Notre 
Dame is an Equal Opportunity / Affirmative Action Employer. 
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212 SANTA CLARA UNIVERSITY, Department of Accounting is authorized to hire two tenure-track 
Assistant, Associate, or Full Professors for Fall 2007. Candidate’s primary teaching interest should 
be Financial Accounting, Managerial Accounting, or Information Systems. Candidates for Assis- 
tant Professor should have completed/nearly completed their Ph.D. and be committed to a schol- 
arly research program. Candidates at higher ranks should have a completed doctorate and a 
continuing record of nationally recognized scholarly research and quality teaching. Reviews of 
applications will begin October 1, 2006, but informal inquiries are welcome before that date. If 
you are interested in learning more about this position, contact: Dr. Michael Calegari, Department 
of Accounting, Leavey School of Business, Santa Clara University, Santa Clara, CA 95053-0380 
(or email: mcalegari@scu.edu). SCU is a private Jesuit university located in Silicon Valley with 
competitive salaries, housing subsidies, and summer research support SCU is an EEO/AA 
employer. 
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Earnings Management Using 
Classification Shifting: An Examination of 
Core Earnings and Special Items 


Sarah Elizabeth McVay 
New York University 


ABSTRACT: This paper examines the classification of Items within the Income state- 
ment as an earnings management tool. Evidence Is consistent with managers oppor- 
tunistically shifting expenses from core expenses (cost of goods sold and selling, gen- 
eral, and administrative expenses) to special Items. This vertical movement of expenses 
does not change bottom-line eamings, but overstates “core” eamings. In addition, it 
appears that managers use this earnings management tool to meet the analyst forecast 
earnings benchmark, as special items tend to be excluded from both pro forma and 
analyst earnings definitions. 


Keywords: eamings management; eamings components; special items; analyst 
forecasts. 


Data Availability: Data are avallable from public sources identified in the paper. 


I. INTRODUCTION 

arnings management, the misrepresentation or masking of true economic perform- 
ance, has been the focus of many papers. The bulk of this literature has focused on 
two general earnings management tools: accrual management and the manipulation 
of real economic activities. This paper examines a third potential earnings management tool 
that has been largely ignored to date: the deliberate misclassification of items within the 

income statement (herein referred to as classification shifting). 
I argue that managers wishing to maximize reported performance might shift expenses 
down (or revenue up) the income statement to present a picture that is not consistent with 


This paper is based on my dissertation at the University of Michigan. I am extremely grateful to the members of 
my dissertation committee, Russ Lundholm (Chair), Patty Dechow, Ilia Dichev, Lutz Kilian, and Tyler Shumway, 
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Anilowski, Jeffrey Doyle, Lucile Faurel, Natalie Gallery, Weili Ge, Helen Gernon, Venky Nagar, Karen Nelson, 
Scott Richardson, Joshua Ronen, Stephen Ryan, Cathy Shakespeare, Terry Shevlin (the editor), Doug Skinner, 
Mark Soliman, Irem Tuna, ard the University of Michigan doctoral students. The paper also benefited from the 
comments of workshop participants at the 2005 Accounting and Finance Association of Australia and New Zealand 
Annual Meeting, the 2005 American Accounting Association Annual Meeting, Columbia University, Emory Uni- ` 
versity, New York University, Northwestern University, Rice University, Stanford University, University of Chicago, 
University of Michigan, University of Oregon, University of Pennsylvania, University of Southern California, and 
University of Utah. I appreciate the research assistance of Kalin Kolev. An earlier version of this paper was entitled, 
“The Use of Special Items to Inflate Core Earnings.” 
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economic reality. Classification shifting is distinct from accrual management and the ma- 
nipulation of real activities in several ways. First, classification shifting does not change 
GAAP earnings; to the extent that financial statement users focus solely on GAAP earnings, 
classification shifting would be pointless. However, the individual components of the in- 
come statement are meant to be informative to financial statement users, facilitating analysis 
by grouping items with similar characteristics (FASB Accounting Concept No. 5). In gen- 
eral, the closer a line item is to sales, the more permanent this item tends to be (e.g., Lipe 
1986; Fairfield et al. 1996). Furthermore, investors appear to recognize this distinction and 
weight individual line items within the income statement differently (e.g., Lipe 1986; Elliott 
and Hanna 1996; Francis et al. 1996; Davis 2002; Bradshaw and Sloan 2002). 

Next, while all three methods of (income-increasing) earnings management raise ex- 
pectations of future performance, accrual management and the manipulation of real activ- 
ities also reduce earnings in future (or past) periods. In contrast, there is no “settling up” 
using classification shifting. Absent additional earnings management, next period’s earnings 
are equal to actual earnings, rather than earnings less the cost of earnings management in 
the prior period, greatly reducing the cost of this earnings management tool. Finally, GAAP 
net income does not change, thus limiting the scrutiny of auditors and regulators (Nelson 
et al. 2002). 

To document classification shifting, I focus on the allocation of expenses between core 
expenses (defined as cost of goods sold and selling, general, and administrative expenses) 
and special items. I posit that managers wishing to manage core earnings upward will shift 
expenses that should be classified as core expenses to special items.! As an anecdotal 
example of classification shifting? the SEC determined that Borden, Inc., classified $192 
million of marketing expenses as part of a restructuring charge when it should have been 
included in selling, general, and administrative expenses (Hwang 1994).3 

For a sample of 76,901 firm-year observations from 1989 to 2003, I decompose firms' 
core earnings into expected and unexpected components by modeling expected core earn- 
ings, in a similar vein as the accrual model (Jones 1991). I find that unexpected core 
earnings (reported core earnings less predicted core earnings) is increasing in special items.* 
This association is consistent with managers classifying core expenses as special items, 
increasing both core earnings and income-decreasing special items. 


! Alternatively, managers might shift revenue that should be classified as special items upward to be netted against 

core expenses. For example, IBM netted gains on asset sales against selling, general, and administrative expenses, 

presenting higher core earnings when instead the gains should have been broken out as special items (Bulkeley 
2002). 

Additional examples of classification shifting include AmeriServe Food Distribution (which, prior to filing for 

bankruptcy, classified substantial operating expenses as restructuring charges, allegedly to mask deteriorating 

financial performance [Sherer 2000]); SmarTalk (which reported a one-time charge in 1997 that improperly 
included 1997 operating expenses, enabling SmarTalk to falsely inflate 1997 earnings before one-time charges 

[SEC Accounting and Auditing Enforcement Release No. 1721]; Waste Management (which netted one-time 

gains against current-period operating expenses [SEC Accounting and Auditing Enforcement Release No. 1405]); 

and Anicom [which, in 1999, charged over $7.65 million in expenses to a one-time charge, including a bank 

fee incurred when Anicom violated its working capital loan covenants, a note receivable with accrued interest, 
charges incurred for enhancement to a computerized inventory control system, and over $2.3 million in accounts 
receivable write-offs [http:/ / www.sec.gov/litigation/complaints/complr17504.htm]). 

3 Classification shifting is a different use of special items to manage earnings than the interperiod shifting doc- 
umented in Burgstahler et al. (2002) and Moehrle (2002), who examine accrual management across time, using 
special items and restructuring charges, respectively. These studies do not examine shifting within the income 
Statement. 

3 Throughout the paper, positive special items are income-decreasing; Compustat #17 is multiplied by —1. I only 
examine income-decreasing special items in this paper. Classification shifting using income-increasing special 
items is left for future research. 


~ 


The Accounting Review, May 2006 


Earnings Management Using Classification Shifting 503 


While the above finding is consistent with classification shifting, it is also consistent 
with firms experiencing efficiency gains in the year the special item is recognized by stream- 
lining their operations or divesting. unprofitable lines of business; in doing so, these firms 
might experience an unexpected increase in core earnings. To distinguish between these 
competing explanations, I examine whether the unexpected core earnings in year 1 persist 
into year 1+1. A reversal of this improvement is more consistent with temporary earnings 
management, while the continued presence of this improvement is more consistent with 
economic improvements associated with the special item. I find that the unusually high 
core earnings associated with special items appear to reverse in the following period. 

The classification shifting documented in this paper appears to be significant econom- 
ically and, thus, a viable tool for managers. Based on my model of expected core earnings, 
І estimate that approximately 2.2 percent of reported special items, on average, are actually 
current-period operating expenses that are not transitory, but are opportunistically classified 
as special? On a per-firm-year basis across the entire sample, this translates into a mean 
shift of $287 thousand of recurring expenses to special items, an average of roughly one- 
half cent per share. For firms with income-decreasing special items of at least 5 percent of 
sales, the estimated mean shifted amount per firm-year is $1.66 million across 8,043 firm- 
years, translating into almost three cents per share. Moreover, there is ample opportunity 
to classification shift using special items; 31 percent of the observations in my sample 
recognize income-decreasing special items in any given year. 

It is important to note that my inferences of classification shifting rely on a model of 
core earnings. This is a necessary research design choice, as special-item firms tend to be 
performing extremely poorly. On average, these firms have lower core earnings than the 
Compustat population. To document that unexpected core earnings are increasing with 
special items requires strong performance controls. In order to adequately control for per- 
formance, I include current-year accruals. The inclusion of this variable introduces a pos- 
sible bias in the model, discussed in greater detail in Section V, and is a limitation of the 
model and therefore this study. 

To provide additional support of my inference of classification shifting, I conduct an 
array of supplementary tests. First, as with any earnings management mechanism, the net 
benefits are expected to be greater in some settings than in others. For example, Dechow 
and Skinner (2000) note that earnings management will likely be greater when the action 
allows managers to meet the analyst forecast when they otherwise would not. Consistent 
with classification shifting representing an earnings management tool, I find that classifi- 
cation shifting is more pervasive when it allows the manager to meet the analyst forecast, 
especially for growth firms (Skinner and Sloan 2002). The amount shifted increases to as 
much as 16.2 percent of special items. 

Next, I examine firms with special items in year t+1 and find that the "reversal" of 
unexpected core earnings is significantly muted for these firms, consistent with them once 
again having the opportunity to shift core expenses to special items and thus maintain the 
overstated core earnings. Third, I hand-collect the income statements and related footnotes 
of 190 S&P 500 firms that have income-decreasing special items of at least 5 percent of 
sales during 1996 to 2000. For this subsample, I split Compustat special items into two 


5 This paper does not argue that investors are naive or easily misled. It is possible that investors are able to 
determine that current-period core earnings are unexpectedly high, as is documented in this paper, however the 
financial statement users cannot ascertain why the core earnings are higher than expected. Managers can simply 
attribute the abnormal performance to economic improvements associated with the special items. In this paper, 
finding evidence that the unexpectedly high core earnings in year t reverses in year t+1 allows for ex post 
corroboration of earnings management. 
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groups: those that are amenable to classification shifting (such as other unusual charges) 
and those that are not (such as the loss on the sale of an asset). I confirm that only those 
special items amenable to classification shifting are associated with the unexpectedly high 
core earnings in year t and the subsequent reversal of this improvement in year t+1. Finally, 
I find that classification shifting is associated with negative returns in the subsequent year, 
suggesting that investors are surprised when expenses that were previously excluded from 
core earnings recur. 

Classification shifting offers a very attractive earnings management tool. In the context 
of special items, classification shifting can increase pro forma earnings substantially— 
nearly three cents per share for firms with income-decreasing special items of at least 5 
percent of sales—providing managers with a relatively low-cost tool to meet analyst fore- 
casts. This finding should be informative to analysts, investors, and regulators. Further- 
more, the shifting of core expenses to special items examined in this paper is just one of 
many possible classification-shifting schemes. Managers likely undertake classification 
shifting using other accounts, such as discontinued operations or research and develop- 
ment—an expense that is associated with future benefits. These actions do not change 
bottom-line earnings, but can have a significant impact on the expectations of investors and 
other financial statement users. 

The paper proceeds as follows. Section II provides some background and develops the 
hypotheses. Section Ш discusses the data and provides descriptive statistics. Section IV 
introduces the model of core earnings, and Section V describes the tests and results. Section 
VI concludes and offers avenues for future research. 


IL MOTIVATION AND HYPOTHESES 

Prior accounting research has documented two main methods of earnings management. 
The most commonly studied method is accrual management (e.g., Healy 1985; Jones 1991; 
McNichols and Wilson 1988; Rangan 1998; Teoh et al. 1998; Phillips et al. 2003). Essen- 
tially, a manager can borrow earnings from future periods, through the acceleration of 
revenues or deceleration of expenses, in order to improve current earnings. In addition to 
the cost of detection, this method of earnings management bears a one-to-one cost of 
earnings reduction in the future; future-period earnings will be mechanically lower by the 
net income that was accelerated to current earnings." 

A second type of earnings management can occur through the manipulation of real 
activities, such as providing price discounts to increase sales and cutting discretionary 
expenditures, such as R&D, to manage earnings (e.g., Baber et al. 1991; Dechow and Sloan 
1991; Bushee 1998). Such actions can increase revenues or net income, but they are also 
costly. For example, cutting R&D spending to manage earnings may result in the loss of 
future income related to the forgone R&D opportunities. On the other hand, because the 


$ The evidence in this paper supports prior research on pro forma earnings that suggests managers exclude 
recurring expenses (Doyle et al. 2003). Moreover, while prior research suggests managers use pro forma earnings 
to meet the analyst forecast (e.g., Lougee and Marquardt 2004; Doyle and Soliman 2005), this paper presents 
evidence on how managers might undertake this activity. 

7 Managers can also use accrual management to overstate current-period expenses (ї.е., take a “big bath"). The 
overstatement can be used to offset future operating expenses (Burgstaliler et al. 2002), or reserves associated 
with restructuring charges can be reversed into income in future periods (Moehrle 2002). The cost of detection 
from this type of earnings management is lower than that of the more traditional accrual management. For 
example, Nelson et al. (2002) find that the use of reserves is a common type of earnings management that is 
often left unadjusted by auditors if detected. However, the benefits of this type of earnings management are also 
delayed because it improves future net income. 
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manipulation of real activities is not a GAAP violation, this earnings management tool is 
expected to have a lower cost of detection than accrual management. 

Again, a third potential earnings management tool is the misclassification of items 
within the income statement (classification shifting). Classification shifting bears a relatively 
low cost: there is no accrual that later reverses, nor are there lost revenues from forgone 
opportunities. Moreover, because the allocation of expenses to specific accounts can be 
subjective, auditors might be limited in their ability to verify the appropriate classification, 
and, because bottom-line income does not change, they might expend less energy on the 
identification or compulsory adjustments of these accounts (Nelson et al. 2002).5 

Prior research supports the viability of classification shifting. First of all, without ac- 
tually misclassifying expenses, managers have been shown to manipulate the presentation 
of the income statement in order to influence perceptions of performance. For example, 
Kinney and Trezevant (1997) show that managers are far more likely to break out income- 
decreasing special items than income-increasing special items on the face of the income 
statement, consistent with managers wishing to highlight the transitory nature of expenses, 
but not income. This strategic reporting extends to earnings press releases (Schrand and 
Walther 2000; Bradshaw and Sloan 2002; Bowen et al. 2005). Finally, Davis (2002) finds 
some evidence that managers of Internet firms gross up both revenues and costs of sales 
to maximize reported revenues, an especially relevant metric for these firms. 

In terms of classification, Dye (2002) presents a model where managers attempt to 
secure the preferred accounting classification (within GAAP) of both real transactions (such 
as operating versus capital leases) and income statement classifications (such as classifying 
transitory gains as ordinary income). Empirically, Barnea et al. (1976) conclude that man- 
agers use their subjectivity over the classification of borderline expenses that prior to APB 
No. 30 could be classified as ordinary or extraordinary (such as the loss on the sale of an 
asset) to present smoother ordinary earnings. Givoly et al. (1999) examine the allocation 
of income across segments and find evidence that suggests managers shift income to the 
highest P/E segments, thereby maximizing expected firm value. Weiss (2001) examines the 
treatment of transitory earnings shocks associated with the 1993 increase in corporate 
tax rates, and finds that managers are more likely to highlight the income-decreasing ef- 
fects (consistent with Kinney and Trezevant 1997) and offset current transitory gains with 
income-decreasing special items in an attempt to maximize future core earnings.? ; 

The above research lends credence to the idea that the placement and presentation of 
expenses within tbe income statement is a valid earnings management tool for managers. 
In this paper, I posit that managers intentionally misclassify expenses within the income 
statement—a GAAP violation. Specifically, Í argue that managers classify a portion of core 
operating expenses as special items in the year that a special item is recognized. This action 
can alter the perceptions of financial statement users because different income statement 


Each of the above earnings: management mechanisms affects the actual income statement filed with the SEC. 
Alternatively, managers might simply exclude recurring expenses from the non-GAAP earnings numbers reported 
in their press releases (e.g., Doyle et al. 2003; Doyle and Soliman 2005). Clearly this mechanism has a lower 
cost, but it might also have a lower benefit. While including core expenses in special items offers some degree 
of camouflage, some pro forma exclusions can be very transparent and thereby treated as recurring by investors. 
For example, anecdotally, non-special-item exclusions include stock-based compensation expenses, payroll taxes 
on stock option exercises, and amortization costs. Along these lines, Gu and Chen (2004) find that both First 
Call analysts and investors appear to include some items in their core earnings calculations that managers 
excluded in press releases. 

Similarly, Cameron and Gallery (2001) find that when firms report income-increasing abnormal items, they tend 
to recognize offsetting incame-decreasing abnormal items. 
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classifications have different information content for future earnings. Moreover, investors 
appear to understand the distinction and do not treat line items homogeneously. 

To document classification shifting, I focus on the shifting of expenses between core 
expenses (cost of goods sold and selling, general, and administrative expenses) and special 
items.!° There are many possible misclassifications to special items. Large charges, such as 
those related to restructurings or mergers, offer a great deal of latitude and camouflage. For 
example, managers can classify normal severance charges as charges resulting from the 
restructuring or merger. A manager might also allocate a greater percentage of legal costs 
or other administrative expenses than were actually related to the restructuring or merger 
to the “special” fees. Classification shifting is not, however, limited to these large charges. 
Many unusual charges might contain misclassified core expenses. Consider Y2K expenses, 
which might contain the salaries of permanent information technology personnel, or a 
litigation gain or loss might contain day-to-day legal fees. 

The appropriate categorization of expenses may not be clear-cut to auditors or other 
outside monitors. While the substantive procedures of auditors might identify unrecorded 
expenses, in this setting it is not the recognition, but merely the classification, that is in 
question. The documentation for expenditures might be too general for the auditor to de- 
termine the appropriate classification. Furthermore, while auditors look for abnormal fluc- 
tuations in margins and other ratios, special-item firms tend to be declining in performance 
(see Figure 1); while reported core earnings might be higher than actual core earnings, the 
reported earnings are likely still below prior-period and industry benchmarks. 

Focusing on classification shifting between core expenses and special items offers a 
powerful test of classification shifting for several reasons. First, core expenses and special 
items are clearly distinct; core expenses tend to be relatively stable, while special items are 
by definition unusual or infrequent. Special items have been shown to be highly transitory 
(e.g., Lipe 1986; Fairfield et al. 1996), and are treated accordingly by investors (e.g., Lipe 
1986; Bradshaw and Sloan 2002). Therefore, classification shifting between core expenses 
and special items, if not fully disentangled by financial statement users, can impact expec- 
tations and thus prices. Moreover, special items tend to be excluded from core earnings by 
both managers (e.g., Lougee and Marquardt 2004) and analysts (e.g., Philbrick and Ricks 
1991), and thus classification shifting from core expenses to special items could result in 
managers ex post meeting the analyst forecast when they otherwise would not have met 
this benchmark. Finally, as noted above, the shifting of expenses between core expenses 
and special items is viable; managers have subjectivity over the classification of expenses, 
and the shifting is not expected to raise red flags for outside monitors. This leads to my 
first hypothesis: 


H1: Managers classify core expenses as special. 


10 Specifically, I focus on the association between core earnings and special items. Special items are material events 
that arise from a firm's ongoing, continuing activities, but that are either unusual in nature or infrequent in 
occurrence—but not both—and must be disclosed as a separate line item as part of income from continuing 
operations, or in footnotes to the financial statements (Revsine et al. 2005, 55). Examples of special items 
include (1) write-downs or write-offs of receivables, inventories, equipment, or intangibles, (2) gains or losses 
from the sale of equipment or investments, and (3) special one-time charges resulting from corporate restruc- 
turings. Compustat combines special items that are broken out on the income statement with those that are 
solely disclosed in the footnotes. However, successful classification shifting does not require special items to be 
highlighted on the income statement, as managers may highlight footnoted special items in the press release 
and/or analysts may exclude these charges from pro forma earnings. For example, in 2001, Cisco Systems Inc. 
reported a $2.77 billion inventory write-off, which they included in cost of goods sold within the income 
statement, but which was removed from pro forma earnings in their press release. 
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In particular, I expect unexpected core earnings to be increasing in special items in year t. 
Furthermore, to ensure that this is due to classification shifting rather than an economic 
improvement associated with the special item, I expect this improvement to "reverse" in 
year {+1 as the core expenses excluded in year t recur in year t+1. 

Clearly there are costs to classification shifting. In addition to the cost of detection, in 
general, managers want to avoid raising future expectations of investors or other parties. 
Therefore, I expect managers to classification shift to a greater degree in periods when the 
benefits to shifting are greater (holding costs to shifting constant). Benefits are expected to 
be particularly high when the earnings management allows the manager to meet earnings 
benchmarks (Dechow and Skinner 2000). The consensus analyst forecast has become the 
most important earnings benchmark in recent years (Dechow et al. 2003; Brown and Caylor 
2005), and this benchmark typically excludes special items and other nonrecurring charges 
(Philbrick and Ricks 1991). 

Moreover, meeting the analyst forecast has been shown to affect capital markets. Firms 
that meet analyst forecasts have an equity premium (Bartov et al. 2002; Kasznik and 
McNichols 2002), and missing the analyst forecast can result in a large decline in stock 
price, especially for high-growth firms (Skinner and Sloan 2002). Thus, the benefits to 
classification shifting are presumably greater for managers who can use their discretion 
to meet the analyst forecast, especially in high-growth firms. This leads to my second 
hypothesis: 


H2: Managers classify more core expenses as special in periods when the net benefits 
to classification shifting are expected to be greater. 


In particular, I expect classification shifting to be more pervasive when the shifting allows 
managers to meet the consensus analyst forecast, especially in high-growth firms. 


Ш. DATA, SAMPLE SELECTION, AND DESCRIPTIVE STATISTICS 

Data and Sample Selection 

Data are obtained for the years 1988 to 2003 from the 2003 Annual Compustat File, 
I/B/E/S Split-Unadjusted File, and CRSP Daily Return Tapes.!! Each firm-year observation 
is required to have sufficient data to test H1. Observations with sales of less than $1 million 
are deleted to avoid the creation of outliers, as sales is used as a deflator for the majority 
. of the variables. Also, firms that had a fiscal-year-end change from t— 1 to t or from t to 
t+1 are deleted to help ensure that years are comparable. Finally, I require a minimum of 
15 observations per industry per fiscal year in order to ensure a sufficiently large pool to 
estimate expected core earnings. Industries are classified following Fama and French 
(1997); results are not sensitive to the number of required observations or the industry 
classification scheme. The full sample has 76,901 firm-year observations. 


п As my measures require one year of lagged data and one year of future data, the actual years examined are 
1989-2002. The sample begins in 1988, after Compustat began reporting the newly required Cash From Op- 
erations. This allows me to calculate Accruals as Earnings less Cash From Operations as prescribed by Hribar 
and Collins (2002). This is particularly important in my setting because accruals are measured with error using 
the balance-sheet approach, especially for firms that have had Merger and Acquisition (M&A) activity (Hribar 
and Collins 2002), and special items often arise as a result of M&A activity. I assign a zero to special items 
(#17) if that data item is missing, consistent with Elliott and Hanna (1996). I also re-estimate my main analysis 
after excluding observations with missing special items. Results are not sensitive to this alternative approach. 
In addition, I assign a zero to extraordinary items and discontinued operations from the statement of cash flows 
(#124), which is used to calculate accruals, if that data item is missing. 
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Descriptive Statistics 

Table 1 provides descriptive statistics for the main variables. The mean (median) core 
earnings for all firms, which by definition is scaled by sales, is 0.070 (0.107).? Mean 
income-decreasing special items as a percentage of sales is approximately 2.7 percent— 
income-increasing special items are not included in the analysis and have been set to zero. 

Table 2 compares firms with and without large income-decreasing special items, where 
large is defined as 5 percent of sales. Not surprisingly, firms that recognize large income- 
decreasing special items have significantly lower core earnings than those firms that do not 
(—0.140 versus 0.095). Firms that undertake large write-offs or corporate restructurings 
tend to be poor performers (Elliott and Shaw 1988; DeAngelo et al. 1994; Carter 2000). 

The change in core earnings from year t—1 to year t is significantly more negative for 
large special-item firms, suggesting that, as conditions worsen, the need for a special item 
such as an asset write-off or a restructuring increases. The change in core earnings from 
year t to 1+1 (i.e., the vear following the special item) is significantly more positive for 
large special-item firms (0.036 versus —0.007). This change is graphically illustrated in 
Figure 1 and is consistent with Brooks and Buckmaster (1976), who document that mean 
reversion is stronger for firms with extreme performance. Accruals are significantly more 
negative for large special-item firms, consistent with DeAngelo et al. (1994). Finally, the 
analyst forecast error is significantly more negative for large special-item firms, consistent 
with Abarbanell and Lehavy (2002). Table 3 provides correlations among the main variables 
and is discussed in the following section. 


IV. MEASURING CLASSIFICATION SHIFTING 

In Section II, I hypothesize that managers shift core expenses to special items; in this 
section, I develop a methodology to measure classification shifting. I expect core earnings 
of special-item firms to be overstated in the year the special item is recognized. I model 
the level of core earnings, and anticipate unexpected core earnings (reported core earnings 
less predicted core earnings) in year f to be increasing with special items in year t if 
managers are classification shifting. As discussed above, an alternative explanation for this 
association is that core earnings are unexpectedly high due to the immediate benefits of the 
restructuring charge or some other real economic event. In order to distinguish between 
real economic changes and the opportunistic behavior of managers, I examine whether the 
improvement associated with special items in year t reverses in year #+1.13 

To test this part of the hypothesis, I model the change in core earnings. I expect the 
unexpected change in core earnings from year г to t+1 to be declining in special items in 
year 1. Thus, operationally, I expect firms that classification shift to have both (1) a higher 
than expected level of core earnings in year г and (2) a lower than expected change in core 


12 Core earnings is defined as operating income before depreciation and special items scaled by sales. Depreciation 
. is excluded to avoid the accounting effect of an asset write-down, which mechanically reduces depreciation 

expense. Sales is used as the scalar, rather than total assets, because assets might be systematically misstated 
for special-item firms. Thus, core earnings is calculated as [(Sales — Cost of Goods Sold — Selling, General, 
and Administrative Expenses)/Sales], where Cost of Goods Sold and Selling, General, and Administrative 
Expenses exclude Depreciation and Amortization, as determined by Compustat. 

15 As previously touched upon, if there was a real economic improvement, it is reasonable to expect the core 
earnings to remain at the improved level, all else equal, while mean reversion would predict a continued upward 
trend on the core earnings, as special-item firms tend to have core earnings that are below average (see Figure 
1). On the other hand, if the unexpectedly high core earnings in the year of the special item were realized by 
shifting some core expenses to the special item, then special items in year г (which include core operating 
expenses) should be associated with a systematic decline in core earnings from year г to t+1, all else equal, as 
the previously excluded core expenses recur. 
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TABLE 1 


Panel A: Descriptive Statistics for the Full Sample 


Standard 
Variable |... . Mean Median Deviation 259 || 759 
SALES, (in millions) 1,566.370 132.521 7,263.780 27.746 641.566 
PERCENT CHANGE IN SALES, , , 21.6% 9.0% 0.567  —1.896 27.096 
CORE EARNINGS, 0.070 0.107 0.385 0.039 0.196 
ACORE EARNINGS, , , 0.011 0.001 0.248 | —0.027 0.026 
ACORE EARNINGS, ал —0.003 0.000 0.210 —0.029 0.025 
UNEXPECTED CORE EARNINGS, 0.001 0.003 0.148 —0.033 0.044 
UNEXPECTED CHANGE IN CORE 0.001 0.001 0.133 —0.033 0.038 
EARNINGS, „+ 
INCOME-DECREASING SPECIAL 13.062 0.000 79.669 0.000 0.627 
ITEMS, (in millions) 
INCOME-DECREASING SPECIAL 2.7% 0.0% 0.114 0.0% 0.6% 
ITEMS AS A PERCENT OF SALES, 
ACCURALS,*. -0104 —0.049 0.282 -0124  —0.002 
ASSET TURNOVER RATIO, 2.82 1.93 3.36 1.06 3.22 
ANALYST FORECAST ERROR, —0.016 0.000 0.093 —0.005 0.002 
Panel B: Variable Definitions with Corresponding Compustat Data Item Numbers 
Variable Definition 
CE, Core Earnings (before Special Items and Depreciation), calculated as (Sales 


— Cost of Goods Sold — Selling, General, and Administrative Expenses) (#13) 
/Sales (#12), where Cost of Goods Sold and Selling, General, and 
Administrative Expenses exclude Depreciation and Amortization, as determined 


я by Compustat. 
АСЕ, Change in Core Earnings, calculated as CE,,, — СЕ, 
UE_CE, Unexpected Core Earnings is the difference between reported and predicted Core 


Earnings, where the predicted value is calculated using the coefficients from 
model (1), as follows, estimated by fiscal year and industry and excluding firm 
ЈЕ СЕ, = Во + ВСЕ, + B;ATO, + BACCRUALS, , + B,ACCRUALS, 
+ B.ASALES, + В„МЕС .ASALES, + ёс 

UE_ACE, Unexpected Change in Core Earnings is the difference between reported and ` 
predicted Change in Core Earnings, where the predicted value is calculated 
using the coefficients from model (2), as follows, estimated by fiscal year and: 
industry and excluding firm i: ACE, = фо + $,CE,_, + &ACE, , + 6,AATO, 
+ $,ACCRUALS, , + $ ACCRUALS, + $ASALES, + NEG. ASALES, + у, 


9651, Income-Decreasing Special Items as a Percentage of Sales, calculated as [Special 
Items, (#17) x —1]/Sales,#12)> when Special Items are income-decreasing, 
and 0 otherwise. 

ASALES, Percent Change in Sales, calculated as (Sales, (#12) - Sales, ,)/Sales, ,. 


МЕС ASALES, Percent Change in Sales (ASALES,) if ASALES, is less than 0, and 0 otherwise. 
-ACCRUALS, Operating Accruals, calculated as [Net Income before Extraordinary Items (#123) 
— Cash From Operations (#308—#124)]/ Зајез (#12). 


(continued on next page) 
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TABLE 1 (Continued) 


ATO, Asset Turnover Ratio, defined as Sales, (#12)/((NOA, + NOA,_,)/2), where 
NOA, or Net Operating Assets, is equal to the difference between Operating 
Assets — Operating Liabilities. Operating Assets is calculated as Total Assets 
(#6) less Cash (#1) and Short-Term Investments (#32). Operating liabilities is 
calculated as Total Assets (#6) less Total Debt (#9 and #34), less Book Value 
of Common and Preferred Equity (#60 and #130), less Minority Interests 
(#38). Average net operating assets is required to be positive. 


AATO,. Change in Asset Turnover, calculated as ATO, — АТО, 1. 

FE, Analyst Forecast Error for year 7, defined as Actual Earnings as reported by 
I/B/E/S less the Median I/B/E/S Analyst Forecast. 

MB, Market-to-Book Ratio, defined as Market Value (#25 X #199) divided by Book 
Value (#60). 


The subscript t represents the year and all variables are firm-specific. All variables are winsorized 
by year at the extreme 1 percent and 99 percent. 


There are a maximum of 76,901 firm-year observations for each variable. Core Earnings is before special items 
and depreciation and defined as ((Sales — Cost of Goods Sold — Selling, General, and Administrative Expenses)/ 
Sales), where Cost of Goods Sold and Selling, General, and Administrative Expenses exclude Depreciation and 
Amortization, as determined by Compustat. Unexpected Core Earnings in year / and Unexpected Change in Core 
Earnings from year г to t+1 are the differences between reported and predicted Core Earnings and Change in 
Core Earnings, respectively, where the predicted values are calculated using the coefficients from models (1) and 
(2) (shown below), estimated by fiscal year and industry and excluding firm i: 


CE, = Во + B,CE,-, + В-АТО, + BjACCRUALS,., + B,ACCRUALS, + BASALES, 


+ B,NEG_ASALES, + е, (1) 
ACE, = фо + ф,СЕ,_, + фАСЕ, , + ф,ЛАТО, + Ф.АССЕИА15, | + bsACCRUALS, 
+ b,ASALES, + $9NEG..ASALES, + v, (2) 


* Note that accruals are scaled by sales. For comparison purposes, I also calculate accruals scaled by beginning 
of period assets. The mean (median) is —0.054 (—0.049), consistent with the magnitudes in prior research. 

* Note that I multiply special items by —1 in order to be consistent with the general term special items 
representing an income-decreasing item. 


earnings in year t-- 1. This prediction is opposite to what is expected to occur in the normal 
course of business as a result of special items. Referring to Figure 1, reported core earnings 
for large income-decreasing special-item firms fall, on average, in the year the special item 
is recognized, and improve, on average, in the next year. 

I develop a model of expected core earnings, first in levels (to examine year г) and 
then in changes (to examine year t+1). This model attempts to control for economic per- 
formance as well as for macroeconomic and industry shocks. To model the level of, and 
change in, core earnings (CE), I estimate the following models, respectively. Regressions 
are estimated by industry and fiscal year: 


СЕ, = B, + B&,CE, , + B,ATO, + BJACCRUALS, | + BACCRUALS, 


+ B,ASALES, + B,NEG. ASALES, + £, (1) 
ACE, = ф, + ф,СЕ, , + Ф,АСЕ, , + $,AATO, + &,ACCRUALS, , | 
+ bsACCRUALS, + ,ASALES, + ,NEG_ASALES, + у, (2) 


where each of the variables is described below. Table 1, Panel B, presents the calculation 
and corresponding Compustat data item numbers of each of the variables. 
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In the levels model (model 1), my first variable is lagged core earnings (CE,_,). I 
include this variable because core earnings tends to be very persistent (note the correlation 
of 0.80 between core earnings and lagged core earnings in Table 3). Next, I :nclude the 
asset turnover ratio (ATO,), as it has been shown to be inversely related to profit margin 
(e.g., Nissim and Penman 2001), and my definition of core earnings closely parallels profit 
margin. Note the negative correlation between core earnings and asset turnover in Table 3, 
consistent with the studies referenced above. For the purpose of this paper, the inclusion 
of the asset turnover ratio is also important because firms that have large income-decreasing 
special items are likely to be making changes to their operating strategy, possibly altering 
their mix of margin and turnover. 

Sloan (1996) finds that, holding earnings constant, accrual levels are an explanatory 
variable for future performance. Specifically, earnings performance attributable to the ac- 
crual component of earnings exhibits lower persistence than earnings performznce attrib- 
utable to the cash flow component of earnings. Thus, I include prior-year operating accruals 
(ACCRUALS,_,) in my model of core earnings. 

I also include current-year accruals (ACCRUALS,) in my model. Extreme performance 
is highly correlated with changes in accrual levels (DeAngelo et al. 1994). Specifically, 
unusually good performance is associated with a large increase in accruals, and unusually 
poor performance is associated with a large decline in accruals. While it is pcssible that 
extreme accruals could be due to accrual management, this paper focuses oa earnings 
management using special items, and therefore controlling ior accruals, discretionary or 
otherwise, allows for a stronger prediction of core earnings.!^ 

Although core earnings is scaled by sales, the relation is not expected to be constant 
because, as sales grow, fixed costs become smaller per sales dollar. Therefore. I include 
sales growth (ASALES,) as an explanatory variable. I also allow the slope to differ between 
sales increases and decreases (NEG. ASALES,) because Anderson et al. (2003) find that 
costs increase more when activity rises than they decrease when activity falls by an equiv- 
alent amount. 

To model the change in core earnings (model 2), I include both lagged сога earnings 
(CE, |) and the change in core earnings from year t—2 to 1-1 (ACE, ,) to allow the model 
to vary the degree of mean reversion based on the prior-year’s level of core earnings. This 
is important because mean reversion is typically more extreme in the tails (e.g., Freeman 
et al. 1982; Fama and French 2000). Inclusion of both levels and changes is also consistent 
with prior literature that forecasts changes in profitability (e.g., Fama and Freach 2000; 
Fairfield and Yohn 2001; Penman and Zhang 2002). I replace the level of asset turnover 
with the change in asset turnover (ДАТО,), and retain ACCRUALS, ,, ACCRUALS,, 
ASALES,, and NEG. ASALES,. 

Models (1) and (2) are estimated cross-sectionally by industry and fiscal yezr. Table 4 
provides the mean and median regression results for the model of expected core earnings. 
The median adjusted К? is quite high, at approximately 78 percent, and ranges by industry 
from 58 percent for Aircrafts to 94 percent for Wholesale (nct tabulated). 

Referring to Table 4, for the median regression, prior-year core earnings (СЕ, |) is a 
strong predictor of core earnings, as expected, with a coefficient of 0.74 and median p- 
value of less than 0.0001. The asset turnover ratio (ATO,) is weakly significant (one-tailed 
p-value — 0.087) for the median regression, and is only significant in 36 percent of the 
599 industry-year regressions. Thus, within industry-years, the asset turnover ratio does not 


14 While I include accruals to control performance, the inclusion of this variable results in a possible bias, which 
I discuss in detail in Section V. 
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TABLE 4 
Model of Expected Core Earnings—Levels 


Dependent Variable: CE, 


Mean Median Percent Percent with 
Coefficient Coefficient Significant Sign in the 
Independent Predicted (one-tailed (one-tailed (p-value = 0.10, Predicted 
Variables Sign p-value) p-value) one-tailed test) Direction 
Intercept 0.04 0.03 
(0.123) (0.044) 
CE, , * 0.74 0.74 99.7 99.8 
(0.001) (0.000) 
ATO, = —0.003 —0.002 36.2 66.6 
(0.101) (0.087) 
ACCRUALS,_, = —0.18 —0.18 70.6 848 , 
(0.045) (0.005) 
ACCRUALS, + 0.22 0.18 74.0 81.5 
(0.041) (0.003) 
ASALES, + 0.06 0.05 58.4 78.5 
| (0.062) (0.023) 
NEG_ASALES, + 0.27 0.21 62.9 78.8 
(0.058) (0.019) 
Adjusted R? 75.5% 78.1% 


There are 76,901 observations and 599 industry-year regressions. Regressions are estimated by industry and 
fiscal year, and the p-values shown are based on one-tailed tests for each of the independent variables with the 
exception of the intercept, which does not have a sign prediction. p-values, rather than t-statistics, are provided 
due to the varying sample sizes of the specific regressions (which range from 15 to 851 observations). CE, is 
Core Earnings, calculated as (Sales, — Cost of Goods Sold, — Selling, General, and Administrative Expenses,)/ 
Sales, where Cost of Goods Sold and Selling, General, and Administrative Expenses exclude Depreciation and 
Amortization, as determined by Compustat. АТО, is the asset turnover ratio, defined as Sales,/((NOA, + NOA,_,)/ 
2), where NOA, or Net Operating Assets, is equal to Operating Assets — Operating Liabilities. Operating Assets 
is calculated as Total Assets — Cash and Short-Term Investments. Operating Liabilities is calculated as Total 
Assets — Total Debt — Book Value of Common and Preferred Equity — Minority Interests, where average net 
operating assets is required to be positive. ACCRUALS, is Operating Accruals, calculated as [(Net Income before 
Extraordinary Items — Cash From Operations) /Sales]. " ASALES, is the percentage change in sales from year t—1 
to t (Sales, — Sales, ,)/ (Sales, ,). NEG. ASALES, is ASALES, if ASALES, is negative, and 0 otherwise. 

All variables are winsorized at 1 percent and 99 percent. 

See Table 1, Panel B, for Compustat data item numbers. 


appear to be strongly associated with core earnings. This association remains relatively 
weak in univariate industry-year regressions; 49 percent of the 599 regressions are statis- 
tically significant (not tabulated). These weak within-industry results are consistent with 
Soliman (2004), who finds that the negative relation between asset turnover and profit 
margin is largely driven by industry association. 

Prior-year accruals (ACCRUALS,_,) has a coefficient of —0.18, consistent with higher 
levels of accruals having lower earnings persistence. The positive coefficient of 0.18 on 
current-year accruals (ACCRUALS,) is also as predicted. Consistent with Anderson et al. 
(2003), the slope coefficient on sales growth (ASALES,) is significantly larger for firms that 
experience a sales decline (0.05 versus 0.26, where 0.26 is obtained by summing В, and 


. Table 5 presents the mean and median regression results for Equation (2), the model 
of change in core earnings. The median adjusted R? is 51.6 percent. For the median 


The Accounting Review, May 2006 


516 McVay 





TABLE 5 
Model of Expected Core Earnings—Changes 


Dependent Variable: ACE, 


Mean Median Percent Percent with 
Coefficient Coefficient Significant Sign in the 

Independent Predicted (one-tailed (one-tailed ^ (p-value = 0.10, Predicted 

Variables Sign p-value) p-value) one-tailed test) Direction 

Intercept 0.02 0.02 
(0.271) (0.121) 

СЕ, - —0.16 —0.16 73.5 85.6 
(0.044) (0.001) 

ACE, , = —0.08 —0.07 63.3 63.6 
(0.057) (0.023) 

AATO, + 0.004 0.003 41.1 64.1 
(0.094) (0.074) | 

ACCRUALS,_, - -0.17 —0.15 72.5 870 . 
(0.045) (0.007) 

ACCRUALS, + 0.20 0.19 75.5 85.5 
(0.039) (0.001) 

ASALES, + 0.06 0.04 52.1 75.3 
(0.073) (0.044) 

NEG .ASALES, + 0.25 0.20 62.6 79.6 
(0.056) . (0.015) 

Adjusted R? 51.7% 51.6% 


There are 76,901 observations and 599 industry-year regressions. Regressions are estimated by industry and 
fiscal year, and the p-values shown are based on one-tailed tests for each of the independent variables with the 
exception of the intercept, which does not have a sign prediction. p-values, rather than t-statistics, are provided 
due to the varying sample sizes of the specific regressions (which range from 15 to 851 observations). CE, is 
Core Earnings, calculated as (Sales, — Cost of Goods Sold, — Selling, General, and Administrative Expenses,)/ 
Sales, where Cost of Goods Sold and Selling, General, and Administrative Expenses exclude Depreciation and 
Amortization, as determined by Compustat. ATO, is the asset turnover ratio, defined as Sales,/((NOA, + и 
2), where МОА, or Net Operating Assets, is equal to Operating Assets — Operating Liabilities. Operating Assets 

is calculated as Total Assets — Cash and Short-Term Investments. Operating Liabilities is calculated as Total 
Assets — Total Debt — Book Value of Common and Preferred Equity — Minority Interests, where average net 
operating assets is required to be positive. ACCRUALS, is Operating Accruals, calculated as [(Net Income before 
Extraordinary Items — Cash From Operations)/Sales]. ASALES, is the percentage change in sales from year t~1 
to t (Sales, — Sales, ,)/(Sales,.,). МЕС. ASALES, is ASALES, if ASALES, is negative, and 0 otherwise. 

All variables are winsorized at 1 percent and 99 percent. 

See Table 1, Panel B, for Compustat data item numbers. 


industry-year regression, all of the variables are statistically significant (p-value « 0.10, 
one-tailed) and have the predicted signs. The level of core earnings is negatively associated 
with the change in core earnings, consistent with mean reversion (Freeman et al. 1982). 
The change in core earnings from year 1—2 to 1—1 15. also negatively associated with the 
change in core earnings from 1—1 to 1, consistent with Brooks and Buckmaster (1976). The 
change in the asset turnover ratio is weakly significant in the mean and median regressions; 
the sign is consistent with the relation found in Penman and Zhang (2002). However, in 
the majority (59 percent) of the 599 industry-year regressions, this variable is not statisti- 
cally significant. 

Unexpected core earnings and unexpected change in core earnings are the differences 
between reported and predicted core earnings and change in core earnings, respectively. 
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The predicted values are calculated using the coefficients from models (1) and (2) above, 
estimated by fiscal year and industry and excluding firm i. Tables 1—3 provide descriptive 
statistics for these residuals. Referring to Table 3, core earnings and unexpected core earn- 
ings are positively correlated (0.41), as are the partitions of the discretionary accrual model. 
This positive association potentially confounds many studies of earnings management be- 
cause both partitions (i.e., discretionary and nondiscretionary) are expected to be correlated 
in the same direction with the variable of interest (e.g., McNichols and Wilson 1988; 
Dechow et al. 1995; McNichols 2000). However, it is important to note that special items 
are negatively correlated with core earnings, while H1 posits a positive relation between 
special items and unexpected core earnings. 


V. TEST DESIGN AND RESULTS 

Main Analysis 

Hypothesis 1 predicts that managers shift core expenses to special items ("classification 
shift"). As discussed in Section IV, if managers classification shift, then unexpected core 
earnings in year t is expected to be increasing with special items in year t, and the unex- 
pected change in core earnings in year t+1 is expected to be decreasing with special items 
in year і. These anticipated associations are opposite those expected for actual core earnings, 
as illustrated in Figure 1. To test H1, I estimate the following regressions: 


UE CE, = o, + o,968I, + е, (3a) 
UE АСЕ,,ү = то + NPSI, + у, (3b) 


where СЕ. СЕ, is unexpected core earnings in year t, and UE .ACE,,, is unexpected change 
in core earnings in year 1+1, the difference between reported and predicted core earnings 
and change in core earnings, respectively, where the predicted values are calculated using 
the coefficients from models (1) and (2) above, estimated by fiscal year and industry and 
excluding firm i. The variable 9551, is defined as income-decreasing special items scaled 
by sales, both in year t. Note that a positive special item corresponds to an income- 
decreasing special item (income-increasing special items are set to zero); I therefore predict 
a, to be positive and т, to be negative. Recall that many other variables were used in the 
generation of unexpected core earnings, my dependent variable. As such, I do not add 
additional control variables to Equations (3a) and (3b). In untabulated tests I include 
industry-specific indicator variables; results are not sensitive to the inclusion of these 
variables. 

I consider three samples in Table 6, (1) all Compustat firms, (2) those firms with non- 
zero income-decreasing special items, and (3) those firms with income-decreasing special 
items of at least 5 percent of sales, presented in the first, second, and third columns of 
results, respectively. Referring to the first column in Table 6, I find that special items are 
positively associated with unexpected core earnings (o, — 0.022), as predicted. A one 
standard deviation increase in special items is expected to result in an increase in unex- 
pected core earnings (scaled by sales) of 25 basis points [0.022 (the coefficient from Table 
6) multiplied by 0.114 (the standard deviation of income-decreasing special items for this 
sample)]. Not surprisingly, as the sample is narrowed down to focus on those firms that 
have greater opportunity to classification shift, the adjusted R? of the models increases 
monotonically. For those firms with special items of at least 5 percent of sales, a one 
standard deviation increase in special items is expected to result in an increase in unex- 
pected core earnings of 63 basis points. 
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TABLE 6 


Regression of Unexpected € Core Earnings and Future Unexpected Change in Core Earnings 
on Special Items as a Percentage of Sales 


Dependent Variable = UE_CE, 
Non-Zero Income- Іпсоте-Ресгеаѕіпр 


Independent Predicted All Compustat Decreasing Special Special Items = 5% 
Variables Sign Firms Items of Sales 
Intercept 0.000 0.003 -0.000 
(0.26) (2.18) (-0.07) 
PSI, + 0.022 0.017 0.022 
(4.61) (3.06) (2.39) 
Adjusted R? 0.03% 0.04% 0.06% 
Dependent Variable = UE_ACE,,, 
Non-Zero Income- | Income-Decreasing 
Independent Predicted All Compustat Decreasing Special Special Items = 5% 
Variables Sign Firms Items of Sales 
Intercept 0.001 0.001 0.010 
(2.45) (5.42) (3.47) 
9651, - -0.010 -0.018 -0.025 
(-2.24) (-3.67) (-3.20) 
Adjusted R? 0.01% 0.05% 0.11% 
Number of 76,901 23,743 8,043 
Observations 
Standard Deviation 0.114 0.193 0.286 
of 96S], 


The sample consists of 76,901 firm-year observations, t-statistics are shown in parentheses. Unexpected Core 
Earnings in year г (ЛЕ СЕ) and Unexpected Change in Core Earnings from year t to t+ 1 (UE. ACE,,,) are the 
differences between reported and predicted Core Earnings and Change in Core Earnings, respectively, where the 
predicted values are calculated using the coefficients from models (1) and (2) (shown below), estimated b y fiscal 
year and industry and excluding firm i: 


СЕ, = В, + B,CE,., + BATO, + B,ACCRUALS, + B,ACCRUALS, + B,ASALES, | 
+ B,NEG_ASALES, + в, me) 


ACE, = фо + 6,CE,.; + ФАСЕ,_, + ф,ААТО, + b,ACCRUALS,_, + ACCRUALS, 
+ Фе АЗАГЕ$, + &4NEG Л ASALES, + v, (2) 


CE, is Core Earnings, calculated as (Sales, — Cost of Goods Sold, — gees General, and Administrative 
Expenses,)/ Sales, where Cost of Goods Sold and Selling, General, and Administrative Expenses exclude 
Depreciation and Amortization, as determined by Compustat. ATO, is the asset turnover ratio, defined as Sales,/ 
((NOA, + NOA,_,)/2), where NOA is Net Operating Assets, and is defined in Table 1, Panel B. ACCRUALS, is 
g Accruals, calculated as [Net Income before Extraordinary Items — Cash From Operations)/ Sales]. 
ASALES, is the percentage change in sales from year 1—1 to г (Sales, — Sales, ,)/ (Sales, ,). NEG. .ASALES, із 
ASALES, if ASALES, is negative, and 0 otherwise. 9651, is income-decreasing special items as reported by 
Compustat (where positive specials item are income-decreasing and income-increasing special items are set to 
zero) scaled by sales, both in year t. 
All variables are winsorized at 1 percent and 99 percent. 
See Table 1, Panel B, for Compustat data item numbers. 
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Referring to the results for Equation (3b), presented in the lower half of Table 6, as 
predicted, special items in year t are negatively associated with the unexpected change in 
core earnings in year t--1 (n, — —0.010). This translates into a reversal of 12 basis points 
(—0.010 х 0.114) in year t+1 for a one standard deviation increase in 9651, for the full 
sample, and of 72 basis points for those firms with special items of at least 5 percent of 
sales. Overall, the results are consistent with managers classifying some core expenses as 
special in the year a special item is recognized. 


Serial Special Items 

In the above analysis for the full sample, a one standard deviation increase in special 
items corresponds to core earnings that are 25 basis points higher than expected, but these 
special items only predict a “reversal” of 12 basis points in year t+1. It is important to 
note, however, that the reversal is expected to be smaller in the presence of interperiod 
shifting (Burgstahler et al. 2002). For example, if the manager shifted $100 of year t 
operating expenses and $25 of year t+1 operating expenses to the special item in year t, 
only $75 of the $100 would “recur” in year t+1. In addition, my tests provide a lower 
bound on the amount of expenses shifted to special items, as my model only picks up 
temporary shifting. Eastman-Kodak, for example, had “nonrecurring” losses in ten ош of 
12 years (Serwer 2002). They could simply classify the same nontransitory expenses as 
special each year. Approximately 16 percent of my sample firms recognize income- 
decreasing special items in both year 1 and year t-- 1. I expect the reversal to be lower when 
there is a special item in year t+ 1, because the manager can simply misclassify the expenses 
again. To provide evidence on how much lower the reversal is in these cases, I estimate 
the following regression: 


UE ACE, = «y + 0,%81, + o, SI IND,,, + HSI, X SLIND,, + €, — (4) 


where SI..IND,,, is an indicator variable that is equal to 1 if the firm has a special item in 
year 1-1, and 0 otherwise. The results in Table 7 show that after controlling for serial 
special items (or the opportunity to classification shift in year 1+1), the actual reversal of 
shifted expenses is higher when firms do not report a special item in year 1+1 (for the full 
sample, o, = —0.026 as compared to т, = —0.010 in Table б). In contrast, when the firm 
has a special item in year t--1, the reversal is zero, on average, for the full sample; œ, 
+ о; is not statistically different from zero. 


Compustat Special Items 

A limitation of the above analyses is the use of Compustat special items, which groups 
many types of special items together. These include items that are not susceptible to clas- 
sification shifting, such as asset write-downs, and items that are more amenable to classi- 
fication: shifting, such as restructuring charges other than asset write-downs or merger- 
related costs. While I use Compustat special items in order to conduct a large sample study, 
I also hand-collect data for a subset of firms. If the results in Table 6 are evidence of 
classification shifting, then I expect the results to be stronger when only the special items 
amenable to misclassification are used. _ | 
"То select my subsample, I identify all firms that were in the S&P 500 from 1996 to 
2000 and had income-decreasing special items, as reported by Compustat, of at least 5 
percent of sales, resulting in 190 firm-years. For this sample of firms, I examine 10-Ks and 
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TABLE 7 


Regression of Future Unexpected Change in Core Earnings on Special Items as a 
Percentage of Sales and Repeat Special Items 


Dependent Variable = UE_ACE,,, 


Non-Zero Income- Income-Decreasing 
Predicted АП Compustat ae ma Special Items = 5% 


Independent Variables Sign Firms of Sales 
Intercept 0.000 0.004 0.007 
| (0.60) (2.53) (1.64) 
9651, - -0.026 -0.045 -0.053 
(-3.58) (—5.23) (—3.64) 
SI .IND,., ? 0.003 0.004 | 0.008 
(2.46) (2.12) (1.31) 
%SI,X SI_IND,,; + 0.022 0.039 0.036 
(2.56) (3.66) (2.08) 
Adjusted R? 0.02% 0.17% 0.28% 
Number of Observations 76,901 23,743 8,043 


The sample consists of 76,901 firm-year observations, t-statistics are shown in parentheses. Unexpected Change 
in Core Earnings from year t to t+1 (UE_ACE,,,) is the difference between reported and predicted Change in 
Core Earnings, where the predicted values are calculated using the coefficients from model (2) (see Table 5), 
estimated by fiscal year and industry and excluding firm i. 9657, is income-decreasing special items as reported 
by Compustat (where positive specials item are income-decreasing and income-increasing special items are set to 
zero) scaled by sales, both in year t. S]. IND,,, is an indicator variable that is equal to 1 if special items in year 
t+1 are non-zero, and 0 otherwise. 

All continuous variables are winsorized at 1 percent and 99 percent. 

See Table 1, Panel B, for additional data definitions and Compustat data item numbers. 


earnings announcements and record the type of transitory charge.!? Next, I form two subsets 
of special items, those that are susceptible to classification shifting and those that are not. 
I consider PP&E write-offs, goodwill write-offs, and losses on asset sales to be unsuscep- 
tible to classification shifting (9651. NOT. SHIFTABLE). All other special items are con- 
sidered to be susceptible to classification shifting (9651. SHIFTABLE). If the PP&E write- 
offs, goodwill write-offs, or losses on asset sales are not clearly broken out from susceptible 
charges, then I classify the entire charge as susceptible. I estimate the following regressions 
and provide the results in Table 8: 


UE. CE, = № + A,96SI. SHIFTABLE, + MSI NOT. SHIFTABLE, + є, (58) 


UE_ACE,,, = фу + 9,%SI_SHIFTABLE, + 9,%81 NOT. SHIFTABLE, + му 
(5b) 


Consistent with classification shifting, А, is positive and significant, while ф, is negative 
and significant (A, = 0.27, o, = —0.13). In economic terms, a one standard deviation 
increase in %SI_SHIFTABLE, is expected to increase unexpected core earnings in year t 
by 243 basis points (0.27 х 0.09) and decrease the unexpected change in core earnings in 
year (41 by 117 basis points (—0.13 х 0.09), where 0.09 is the standard deviation of 


15 Note that in terms of presentation, nearly every firm breaks out the charge оп the income statement and in the 
press release, as noted in prior research (e.g., Kinney and Trezevant 1997; Bowen et al. 2005). 
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TABLES . 
Regression of Unexpected Core Earnings and Future Unexpected Change in Core Earnings 
on Special Items as a Percent of Sales, where Special Items are Classified as *Shiftable" 
or “Not Shiftable" Based on Type 


Independent Variables Predicted Siga ШЕСЕ Predicted Sign | UE ACE,, 


Intercept 0.025 0.024 
| (2.69) | (2.47) 
%SI_.SHIFTABLE, + 0.268 - —0.132 
(4.43) (—2.11) 
%51 .NOT .SHIFTABLE, ? 0.024 ? 0.181 
(0.34) (2.53) 
Adjusted R? - 8.52% 5.10% 
Number of Observations 190 . 190 


The sample consists of 190 firm-year observations from 1996-2000, where the sample firms are in the S&P 500 
in each corresponding year and had income-decreasing Compustat special items of at least 5 percent of sales in 
that year. t-statistics are shown in parentheses. Unexpected Core Earnings in year t (UE..CE,) and Unexpected 
Change in Core Earnings from year t to t+ 1 (UE_ACE,,,) are the differences between reported and predicted 
Core Earnings and Change in Core Earnings, respectively, where the predicted values are calculated using the 
coefficients from models (1) and (2) (see Tables 4 and 5), estimated by fiscal year and industry and excluding 
firm i. 9651. МОТ .SHIFTABLE, represents special items reported by the firm that were either an asset write-off 
(excluding inventory and receivable write-offs) or a loss on the sale of an asset, scaled by sales in the same year. 
All other special items that are reported by the firm are classified as 9551. .SHIFTABLE, and are also scaled by 
sales. 

All variables are winsorized at 1 percent and 99 percent. 

See Table 1, Panel B, for additional data definitions and Compustat data item numbers. 


%SI_SHIFTABLE,. Also consistent with my conclusion of classification shifting, №, is sta- 
tistically insignificant. In other words, only special items that are susceptible to classification 
shifting experience unusually high core earnings in year t. Interestingly, Ф, is positive and 
significant; special items that are not susceptible to classification shifting are associated 
with higher than expected changes in core earnings in the next year, instead of the reversal 
associated with special items tbat are susceptible to classification shifting. This result is 
consistent with real economic improvements as a result of special items, after controlling 
for the effects of classification shifting. Overall, a finer partition of special items into those 
that are susceptible to classification shifting and those that are not improves the power of 
the tests and corroborates the results from the larger Compustat sample. 


Incentives to Meet the Analyst Forecast 

Hypothesis 2 predicts that classification shifting will be more pervasive in periods when 
the firm has greater net benefits to classification shifting; specifically, when the classification 
shifting allows the manager to meet the analyst forecast. To test this hypothesis, I attempt 
to identify firms that would not have met the analyst forecast without classification shifting. 
I expect to find a greater degree of classification shifting by these firms than by the average 
firm. I classify firms that ex post met the analyst forecast by zero or one cent (JUSTMET) 
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as those firms that would not have met the analyst forecast absent classification shifting.’® 
In Table 9, I present tbe results for the following regressions: 


UE CE, = Yo + ү,%$1, + 44JUSTMET, + ү,%51 х JUSTMET, + Е, (6a) 
СЕ ACE,,, = Ө, + 0,9681, + @JUSTMET, + 04,9681, X JUSTMET, + v, (69) 


The results are consistent with managers classification shifting to a greater degree when 
doing so allows them to meet the analyst forecast. À one standard deviation increase in 
special items is expected to increase unexpected core earnings (scaled by ѕғ1еѕ) by an 
additional 31 basis points and have a subsequent reversal of an additional 34 basis points." 





TABLE 9 


Regression of Unexpected Core Earnings and Future Unexpected Change in Core Earnings on 
Special Items as a Percent of Sales and Just Meeting the Analyst Forecast 


Independent Variables UE CE, | UE АСЕ, UE.CE UE ACE,, 

Intercept —0.008 0.002 —0.008 0.002 
(-7.80) (236) (—8.03) (2.55) 

965, 0.015 —0.003 0.014 —0.003 
(1.95) (-0.41) (1.90) (—0.38) 

JUSTMET, . 0.017 —0.001 0.017 —0.001 
(12.77) (71.01) (12.39) (—0.86) 

HIGH MB, 0.003 —0.001 
(1.93) (—0.89) 

%51,х JUSTMET, 0.027 —0.029 —0.000 —0.019 
(2.41) (—2.85) (—0.01) (LT) 

%SI, X JUSTMET, X HIGH MB, 0.148 —0.053 
(6.76) (=2772) 
Adjusted R? 0.48% 0.04% 0.60% 0.06% 
Number of Observations 42,464 42,464 42,464 42,464 


The sample consists of 42,464 firm-year observations, 20,606 firms that just met the analyst forecas-, and 4,800 
high market-to-book firms that just met the analyst forecast. t-statistics are shown in parentheses. Unexpected 
Core Earnings in year t (UE_CE,) and Unexpected Change in Core Earnings from year t to t+1 (UE_ACE,,,) 
are the differences between reported and predicted Core Earnings and Change in Core Earnings, respectively, 
where the predicted values are calculated using the coefficients from models (1) and (2) (see Tables 4 and 5), 
estimated by fiscal year and industry and excluding firm і. 9651, is income-decreasing special items as reported 
by Compustat (where positive specials item are income-decreasing and income-increasing special items are set to 
zero) scaled by sales, both in year t. JUSTMET, is an indicator variable that is equal to 1 if the annual I/B/E/S 
analyst forecast error for year гіз 0 or 1 cent, and 0 otherwise. HIGH MB firms are those in the highest quintile 
with respect to market-to-book ratio in year г. 

All continuous variables are winsorized at 1 percent and 99 percent. 

See Table 1, Panel B, for additional data definitions and Compustat data item numbers. 





16 As a second approach, in results not tabulated, I compare my estimate of classification shifting (e.g., 2.2 percent 
of special items) to the analyst forecast error (adjusting the forecast error to represent pre-tax dollars). If the 
estimate of classification shifting is greater than the forecast error, in pre-tax dollars, then I classify that firm as 
having used classification shifting to meet the analyst forecast. Results under this alternative identification 
procedure also strongly support the notion that managers use classification shifting to meet the analyst forecast, 
especially in high-growth firms 

17 The incremental basis points are calculated by multiplying the coefficients on the interaction terms (0.027 and 
—0.029) by the standard deviation of special items for I/B/E/S firms (0.116). 
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Moreover, this finding is not an artifact of firm performance; if JUSTMET is replaced with 
an indicator variable representing all firms that met the analyst forecast versus those firms 
that "just" met the analyst forecast, then the interaction terms in both regressions are 
insignificant (not tabulated). | 

Some firms have a greater incentive than others to meet the analyst forecast. For ех- 
ample, Skinner and Sloan (2002) find that there is a severe price penalty for growth firms 
that miss the analyst forecast. I examine whether classification shifting is more pervasive 
for growth firms that I expect have used classification shifting to meet the analyst forecast. 
Following Skinner and Sloan (2002), I consider high-growth firms to be those in the highest 
quintile with respect to market-to-book ratio. The final two columns of results in Table 9 
confirm the position that managers of high-growth firms classification shift more than the 
average firm manager in order to meet the analyst forecast. A one standard deviation in- 
crease in special items is expected to incrementally increase unexpected core earnings by 
172 basis points and have a subsequent incremental reversal of 62 basis points for firms 
that are in the top quintile with respect to market-to-book ratio and that just met the analyst 
forecast. These findings provide strong evidence that managers classification shift to meet 
the analyst forecast, especially when they are managers of growth firms. 


Stock Price Implications of Classification Shifting 

One incentive to manage earnings is to temporarily maximize stock price (e.g., Rangan 
1998; Teoh et al. 1998; Bartov and Mohanram 2004). Therefore, it is informative to deter- 
mine whether investors are negatively surprised when the expenses that are shifted from 
core expenses in year / recur as core expenses in year (41. As discussed in footnote 5, 
investors might be able to identify the abnormally high core earnings in year г, but cannot 
distinguish the origin, either a real économic improvement related to the special item or 
classification shifting. It is the recurrence of previously excluded expenses in year t+1 that 
ex post identifies the source of the unexpected core earnings. To investigate whether inves- 
tors are surprised when expenses that were previously excluded from core earnings recur, 
I examine subsequent market-adjusted returns. Specifically, I estimate the following for 
I/B/E/S firms:? 


RET. 1YR,, = Фо + V,UE CE, + ¥,%SI, + V,UE CE, : 0) 
х 9551, + Controls, + v, | 


where RET..1YR,,, is the one-year-ahead market-adjusted, buy-and-hold return, inclusive 
of dividends, beginning four months after the end of fiscal year 1, and continuing for one 
year.’ I control for several known risk factors and anomalies, market value of equity, book- 
to-market ratio, beta, accruals, and momentum. I transform each of the independent vari- 
ables to its scaled decile rank following Bernard and Thomas (1990). Deciles are formed 
monthly to avoid look-ahead bias. To control for cross-sectional correlation in the regression 


18 Because I find the amount of classification shifting increases dramatically to meet the analyst forecast, I focus 
on only those firms followed by I/B/E/S in this analysis to increase the power of my tests. If I consider all 
firms, results lose statistical significance. 

19 І do not require that firms have data for t+1 to be included in this test, as the worst classification shifters may 
simply fail during year ¢+1. In the event of a delisting, the delisting return (which is adjusted for the delisting 
bias documented in Shumway [1997]) is compounded with the abnormal return from the beginning of the 
portfolio formation to the delisting. The proceeds are then reinvested and earn the market return for the remainder 
of the 12-month period. 
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residual, I estimate the regression separately for each of the 14 years in my sample, and 
then construct t-statistics using the 14 sets of coefficient estimates, in a similar vein to 
Fama and MacBeth (1973). 

In results not tabulated, I find that Y, is negative and weakly statistically significant 
(9, = —0.096; t = 1.80). This coefficient can be interpreted as the hedge return from a 
portfolio created to optimize on the information in the variable, after controlling for the 
other variables in the regression (Bernard and Thomas 1990). Thus, the hedge return on 
classification shifting is 9.6 percent. Moreover, these subsequent negative returns are more 
pervasive in the later years of the study, the time period when pro forma earnings became 
more of a focal point for investors (Bradshaw and Sloan 2002). One-year-ahead abnormal 
returns are —19.1 percent (t = 2.01) from 1996 to 2002. Thus, there is some evidence that 
investors do not fully disentangle classification shifting and are negatively surprised when 
previously shifted expenses recur as core expenses.” 


Real Economic Actions as an Alternative Explanation | 

In this paper, evidence of classification shifting is presented by documenting that un- 
expected core earnings is increasing in special items in year t, but that this increase reverses 
in the following year. It is possible, however, that managers elect to cut discretionary 
spending, such as R&D or advertising expenditures, when their performance is extremely 
negative. If this cut is temporary, then the “reversal” that I document might simply be the 
recurrence of these expenses, which were, in fact, not made in year f. Thus, because special 
items and poor performance are correlated, it is possible that I am merely picking up these 
non-opportunistic cuts of discretionary expenditures rather than classification shifting. 

I address this alternative explanation two ways. First, I estimate Equations (3a) and 
(3b) by performance quintile (not tabulated). If my findings merely reflect discretionary 
spending cuts in poorly performing firms, then I would expect to only find results in the 
lowest quintiles of performance. However, evidence of classification shifting is present 
throughout the performance quintiles, mitigating the likelihood that the effect is purely 
performance driven. Second, I control directly for changes in research and development 
expenditures to proxy for discretionary spending. Results are not sensitive to the inclusion 
of this control variable. Thus, it does not appear that discretionary spending cuts are driving 
the relation between core earnings and special items. 


Model Specification as an Alternative Explanation 

The model of core earnings developed in Section IV includes current-year accruals as 
an indicator of extreme performance. This variable is important because special-item firms 
tend to experience extreme negative performance. However, part of the accruals in year г 
may be due to the accrual portion of the year ¢ special item. In the extreme, if accruals are 
entirely special items, then the residual from my model would be orthogonal to special 
items and there would be no bias. In actuality, however, accruals are comprised of both 
“normal” accruals and “special item" accruals. Both of these accruals are expected to be 
associated with performance. However, the association with performance may not be equal 
for both types of accruals, while my model is restricted to treat all types of accruals the 
same. Thus, if special item accruals are more or less associated with performance than 


20 Note that this finding does not conflict with prior research. Burgstahler et al. (2002) find that special items are 
weakly associated with positive future returns right around subsequent earnings announcements. Doyle et al. 
(2003) find that special items are not associated with one-year-ahead abnormal returns. I find that the portion 
of special items I suspect are, in fact, core expenses are negatively associated with future returns, after controlling 
for the main effect of special items, which is weakly positive (Y; = 0.063; t = 1.42). 
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“normal” accruals, then the residuals from my models might vary systematically with 
accrual special items. 

In this section, I re-estimate core earnings and next year’s change in core earnings 
under three alternate specifications, presented in the first three columns of results in Table 
10. The first alternate specification (no accruals) excludes accruals from the models entirely. 
The second specification (accruals = CE — CFO) includes accruals before special items, 


TABLE 10 


Regression of Alternate Core Earnings and Future Change in Core Earnings Metrics on 
Special Items as a Percentage of Sales 


UE_CE, ‘UE_CE, UE_CE, UE _CE, 
Independent Pred. No Accrualsin Accruals = Accruals = Accruals = 
Variables Sign the Model CE – СРО М - СЕО —SI ACC NI — СЕО 
Intercept 0.006 0.004 |». 0.002 0.000 
(11.04) (7.54) (2.83) (0.26) 
% SI, + —0.174 —0.069 —0.032 0.022 
(—36.43) (—16.88) (—6.94) (4.61) 
Adjusted R? 1.7096 0.37% 0.06% 0.03% 
UE_ACE,,, UE_ACE,,, UE АСЕ,,; UE_ACE,,, 
Independent Pred. No Accrualsin | Accruals = Accruals = Accruals = 
Variables Sign the Model CE — CFO NI - CFO — SI-ACC NI — CFO 
Intercept —0.000 —0.000 0.001 0.001 
(-0.34) (-0.64) (0.94) (2.45) 
% SI, - 0.060 0.052 0.010 —0.010 
(13.49) (12.96) (2.45) (-2.24) 
Adjusted R? 0.23% 0.22% 0.01% 0.01% 


The sample consists of 76,901 firm-year observations, t-statistics are shown in parentheses. Unexpected Core 
Earnings in year t (UE_CE,) and Unexpected Change in Core Earnings from year t to 1+ 1 (UE_ACE,,,) are the 
differences between reported and predicted Core Farnings and Change in Core Earnings, respectively, where the 
predicted values are calculated using the coefficients from models (1) and (2) (shown below) estimated by fiscal 
year and industry and excluding firm i: 


CE, = Во + B,CE, , + B, АТО, + BACC. VAR, , + В,АСС. VAR, + B,ASALES, 
+ В МЕС. .ASALES, + e, (1) 


ACE, = фу + 6,CE,_, + ACE, + ф,ААТО, + Ф,АСС. VAR, + 6,ACC.. VAR, 
+ &4ASALES, + &9NEG .ASALES, + v, (2) 


where ACC_ VAR ів equal to the following: 

No Accruals in the Model: ACC. VAR is set equal to zero, 

Accruals = CE — CFO: АСС_УАК is calculated as [(Core Earnings ~ Cash From Operations) / 
Sales], 

ACC_VAR is calculated as [(Net Income before Extraordinary Items 

— Cash From Operations + Accrual Special Items*)/Sales], 

ACC VAR is calculated as [(Net Income before Extraordinary Items 

— Cash From Operations)/Sales]. 


Accruals = NI — CFO + 51 ACC: 
Accruals = NI — CFO: 


«ІТ roughly estimate accrual special items to be the sum of Compustat data items #213 (Loss [Gain] on sale of 
assets) and #217 (Funds from Operations, Other [which includes Reorganization Costs and Special Items]). 
This value is winsorized at net special items (Compustat data item #17) and set to 0 if net special items are 
income-increasing. 

All variables are winsorized at 1 percent and 99 percent. 

See Table 1, Panel B, for additional data definitions and Compustat data item numbers. 
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where accruals are equal to the difference between core earnings (CE) and cash from 
operations (CFO). Essentially, this treats all special items as accruals. The third specification 
(accruals = NI — CFO — SI_ACC) attempts to back out only those special items that are 
expected to be accruals, rather than all special items; accruals are equal to net income 
before extraordinary items (NJ) less cash from operations (CFO), and accrual special items 
(SI_ACC).”! Finally, for comparison purposes, the fourth column of results (accruals = NI 
— CFO) is the complete model. 

I regress these three sets of alternate dependent variables on special items. Results for 
the full sample are no longer consistent with classification shifting. Rather, the performance 
effect appears to swamp any evidence of shifting; in each alternate setting, special items 
and unexpected core earnings are negatively associated, and special items and unexpected 
change in next year’s core earnings are positively associated (consistent with mean rever- 
sion). To pursue the explanation that the model is weak as a result of inadequately con- 
trolling for performance, I re-estimate the main regression and include an indicator variable 
for non-negative core earnings (as these firms should be less subject to the strong perform- 
ance effect). As supporting evidence, in each of the three alternative specifications I find a 
positive and significant coefficient in the initial amount of shifting and a negative and 
significant coefficient on the subsequent reversal (not tabulated). For example, using the 
model that excludes accruals entirely, o, = —0.152 + a, = 0.193, for a net shifting of 
0.041 for firms with non-negative core earnings. 

Referring to Table 11, results continue to support classification shifting when I focus 
on those special items that are amendable to shifting. In the first column, which does not 
have accruals in the model, special items that are amenable to classification shifting (“‘shift- 
able") are weakly positively associated with contemporaneous earnings (a, = 0.092; 

= 1.44; one-tailed p-value of 0.076) and there is a strong negative "reversal" in the 
subsequent year (y, = —0.263; t = —3.51). In results not tabulated, if I examine only those 
firms with non-negative core earnings, then the results strengthen considerably (a, = 0.123; 
t = 2.05; y, = —0.259; t = —3.71), again supporting the notion that accruals are a per- 
formance control. Overall, the results in Table 11 indicate evidence of classification shifting, 
regardless of model used. 

Results also support the existence of classification shifting when I examine those firms 
that just met the analyst forecast, especially for high market-to-book ratio firms (not tab- 
ulated). The interaction terms on %SI x JUSTMET and %SI x JUSTMET x HIGH MB 
are significantly positively associated with contemporaneous unexpected core earnings and 
significantly negatively associated with the unexpected change in core earnings in year 
t-- 1.2 In sum, the models that do not include total accruals appear to do an inadequate job 
of controlling for performance, although some evidence of classification shifting remains. 
As discussed in Section I, since controlling for performance is a necessity of my model, I 
subject the results to additional subtests to corroborate my findings. However, at the end 
of the day, the reliance on an imperfect model is a limitation of my study. 


21 Т roughly estimate accrual special items to be the sum of Compustat data items #213 (Loss [Gain] on sale of 
assets) and #217 (Funds from Operations, Other [which includes Reorganization Costs and Special Items]). This 
value is winsorized at net special items (Compustat data item $17) and set to zero if net special items are 
income-increasing. 

22 The total amount of initial shifting often remains negative in both settings. For example, when accruals are 
excluded from the model entirely, y, = —0.223 + y, = 0.048 for a net shifting of —0.175, as the main effect 
continues to be swamped by performance. 
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VI. CONCLUSION 

This paper examines the classification of items within the income statement as an 
earnings management tool. Unlike accrual management or the manipulation of real activi- 
ties, classification shifting does not change bottom-line earnings and, thus, does not reverse 
in future periods or invite the same level of scrutiny by auditors and regulators. However, 
individual line items have different information content for future earnings and, correspond- 
ingly, for investors. 

I examine classification shifting between core expenses (cost of goods sold and selling, 
general, and administrative expenses) and special items. Using a model of core earnings, 
analogous to that of the accrual model, I find that unexpected core earnings are increasing 
in special items in the year of the special item, and this unexpectedly high performance 
reverses in the following year. Furthermore, I find this shifting to be more pervasive when 
it allows the manager to meet the analyst forecast, as special items tend to be excluded 
from this earnings benchmark. 

Overall, the evidence of classification shifting is compelling: (1) unexpected core earn- 
ings are increasing with special items in year /, but this improvement reverses in the fol- 
lowing period; (2) the unexpected improvement only reverses if there are no special items 
present in year (4-1, otherwise managers appear to classification shift again in year 1-1, 
thereby maintaining the inflated core earnings; (3) these results hold only for those special 
items that, upon inspection, appear to be amenable to classification shifting; (4) these results 
are stronger for firms that just met the analyst forecast, and stronger yet for growth firms 
that just met the analyst forecast; and (5) there is some evidence that classification shifting 
is associated with negative returns in the subsequent year, suggesting that investors are 
negatively surprised when expenses that were previously excluded from core earnings recur. 

There are many possible avenues for future research. First, the current paper relies on 
an imperfect model of core earnings; while this model is a first step to documenting clas- 
sification shifting, improvements of this model might be examined in future studies. Future 
research might also further explore the negative abnormal returns documented herein by 
perhaps focusing on incentives to shift or by examining whether these returns vary cross- 
sectionally, for example with the sophistication of investors. 

It is also possible that managers classification shift to a greater degree preceding equity 
issuances (Rangan 1998; Teoh et al. 1998) or when the valuation multiple on earnings is 
higher-—such as for profit versus loss firms (Hayn 1995) or for those firms with high P/E 
ratios. In addition to capital market incentives, managers might wish to meet the analyst 
forecast in order to maximize their cash bonus (Matsunaga and Park 2001), sell their shares 
(Richardson et al. 2004; McVay et al. 2006), or maximize their compensation in general, 
as compensation committees tend to shield managers from income-decreasing transitory 
charges (Dechow et al. 1994; Gaver and Gaver 1998). These settings are left for future 
research. i 

The classification shifting documented in this paper is part of a much larger phenom- 
enon of earnings management using income-statement classification (e.g., Barnea et al. 
1976; Givoly et al. 1999; Davis 2002). There are many additional settings of classification 
shifting that future research might investigate. For example, managers might shift core 
expenses to expense classifications other than special items, such as R&D, which is valued 
differently than other core operating expenses (e.g., Lev and Sougiannis 1996; Aboody and 
Lev 2000), or non-operating expenses and discontinued operations, which are also excluded 


The Accounting Review, May 2006 


Earnings Management Using Classification Shifting 529 


from analysts’ forecasts (e.g., Abarbanell and Lehavy 2002).? Managers might also shift 
expenses across segments to hide abnormal profits (e.g., Berger and Hann 2003; Botosan 
and Stanford 2005) or allay investor concerns about certain segments. Each of these actions 
can mislead investors about the performance and proper valuation of a company. 
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ABSTRACT: Using 1997-1989 annual performance evaluation data of 214 postal 
stores in Korea, we find that financlal performance improves following the implemen- 
tation of an incentive plan that Includes relative performance measures, and that under 
this Incentive plan, the degree of common uncertainty Is positively associated with 
store profitability. We also find evidence that the Incentive effect of the plan Is mitigated 
In stores at which the employees’ perceived unfairness is likely to be high. Finally, we 
find that the net benefits of Introducing the incentive plan may be conditional on the 
degree of common uncertainty, which appears to be Inversely related to the perceived 
unfaimess attributable to the introduction of the contract with relative performance 
measures. 


Keywords: relative performance measures; incentive plan; perceived unfaimess; degree 
of common uncertainty. 
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I. INTRODUCTION 
elative performance evaluation (RPE) entails evaluating individual or заана 
unit performance relative to the performance of others. Justification for RPE can be 
found in economics and behavioral theories. From an economics perspective, RPE 
provides benefits by deriving information about agents’ efforts when the agents face com- 
mon uncertainty, such as common market conditions. RPE then allows a principal to es- 
tablish contracts that are Pareto superior to those based only on an individual agent’s output 
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(Holmstrom 1982). Behavioral theories highlight intrinsic (nonmonetary) influences on ef- 
fort. Although RPE offers potential benefits, it may motivate reduced effort when workers 
perceive competition with co-workers as unfair (Frederickson 1992). For example, a worker 
may perceive competition with co-workers as unfair if co-workers have much greater ability 
or if the working environments present different challenges. RPE may also motivate un- 
desirable behavior, such as sabotaging co-workers, colluding with co-workers, and shirking 
(Gibbons and Murphy 1990). 

Our study uses data from the entire population of 214 post offices (hereafter, stores) 
managed by a postal service provider in Korea, which added relative performance measures! 
to its incentive plan in 1998. We examine whether (1) including relative performance mea- 
sures in the incentive plan is positively associated with financial performance at the store 
level (the incentive effect), (2) the degree of common uncertainty is positively associated 
with store financial performance, (3) perceived unfairness due to introducing the incentive 
plan mitigates its performance impact, and (4) the degree of common uncertainty is nega- 
tively associated with likely perceived unfairness. 

Our research site provides us with a powerful field-based setting to test the theory of 
RPE incentives. First, regulation precludes the monopolistic postal service provider from 
discontinuing operation in unprofitable areas, resulting in greater cross-sectional variation 
(as compared to unregulated industries) in terms of exogenous store characteristics that may 
influence store performance.? Second, to the extent that performance is largely driven by 
“uncontrollable” exogenous factors, we expect economic benefits from RPE because it 
helps insulate agents’ compensation from uncontrollable “common” uncertainty. 

Our results indicate that introduction of the research site’s incentive plan that introduced 
new relative performance measures is positively associated with store-level financial per- 
formance and that under the incentive plan, a store’s degree of common uncertainty is 
positively associated with profitability. Furthermore,- we find evidence that the incentive 
plan’s impact on financial performance is lower in stores with likely high perceived un- 
fairness. Finally, we find some evidence that a store's degree of common uncertainty is 
negatively associated with the probability that the store has a high level of perceived 
unfairness. 

This study has the following implications. First, it suggests that higher levels of com- 
mon uncertainty with an RPE incentive plan can induce agents to exert more effort, leading 
to better financial performance. Second, prior empirical literature has been silent on the 
determinants of behavioral responses caused by an RPE incentive plan. Our study suggests 
that the degree of common uncertainty within a reference group can play a critical role in 
determining agents' perceived unfairness. A high degree of common uncertainty within 
each reference group appears to decrease the costs due to, for example, perceived unfairness 
of benchmarked targets among employees. 

Our study contributes to the RPE literature by providing empirical evidence on RPE's 
performance impact when agents face varying degrees of common uncertainty. Furthermore, 
by examining whether RPE's incentive effect is moderated in stores where employees per- 
ceive unfairness, this study contributes to a growing literature that investigates the effect of 
compensation policies on performance and dysfunctional responses to compensation 


! We use “relative performance measures” to refer to measures that are used in comparing one agent or business 
unit to another, and "relative performance evaluation" to refer to a scheme that bases a person's compensation 
or nonmonetary performance evaluation at least in part on relative performance measures. 

2 A “cream-skimming”’ strategy, often observed in unregulated industries, will likely lead to a concentration of 
stores in profitable areas, resulting in less cross-sectional variation in terms of stores’ external business 
environment, 
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schemes. Our study also extends the literature on the importance of the degree of common 
uncertainty (Frederickson 1992) in determining the effectiveness of RPE contracts. In ad- 
dition, this study contributes to the budgeting literature by providing some of the first 
empirical evidence on motivational effects of benchmarked accounting performance stan- 
dards (Hansen et al. 2003). Finally, our study contributes to the growing literature that 
emphasizes the important role performance measurement systems can play in improving 
the efficiency of government operations. For example, Ittner and Larcker (1998a) suggest 
that the most fundamental question in governmental performance measurement practice is 
whether private sector notions of performance measurement and accountability are appli- 
cable in the public sector. 

The remainder of the paper is organized as follows. We review prior literature in Section 
П and describe our research site in Section Ш. Section IV develops our hypotheses. Sec- 
tion V describes our sample and variables, and presents descriptive statistics. Section VI 
presents our estimation models and results. Concluding remarks are offered in Section VIL. 


IL PRIOR LITERATURE 

Due to data availability constraints, the extant archival empirical literature based on 
agency-theoretic predictions about RPE has largely focused on investigating whether top 
executives are compensated as if their performance is evaluated relative to competitors’ 
performance and provides mixed results (Antle and Smith 1986; Gibbons and Murphy 1990; 
Janakiraman et al. 1992; Sloan 1993; Aggarwal and Samwick 1999; Anderson et al. 2001). 
For example, using data from 1947 to 1977 for 39 firms, Antle and Smith (1986) find weak 
evidence that the executives’ compensation falls as other firms perform better, holding 
their own performance fixed. Using a comprehensive survey, Gibbons and Murphy (1990) 
find that executives are penalized when a competitor group performs better. However, 
Janakiraman et al. (1992) find little empirical evidence that the market and industry com- 
ponents of firm performance are completely removed in determining CEO compensation. 

Research on the performance consequences of RPE or the relationship between RPE 
use and common uncertainty includes Casas-Arce and Martínez-Jerez (2005), who provide 
theoretical and empirical results on retailers compensated via dynamic tournaments, a form 
of RPE. Frederickson (1992) provides experimental evidence that agents’ effort levels are 
higher with an RPE contract than with a profit-sharing contract where compensation is 
solely tied to agents’ absolute performance. He also finds that subjects’ effort levels in- 
creased significantly as the degree of common uncertainty increased with the RPE contract 
but not with the non-RPE contract. Using CEO compensation data, Kren (2002) finds that 
the degree of emphasis on RPE is positively related to the level of common uncertainty. 

Combined, little empirical evidence is available on the performance consequences of 
RPE incentive plans when there is an important source of common uncertainty, especially 
at non-executive levels.? Moreover, to our knowledge, no study has empirically documented 
how dysfunctional responses offset an RPE contract’s incentive effect. In addition, in their 
survey on budgeting practices, Hansen et al. (2003, 106) point out that even though relative 


3 Several studies that did not focus on RPE have found substantial incentive effects at non-executive levels. Banker 
et al. (1996) report that the implementation of a performance-based compensation plan increased sales and that 
the effect persisted and grew over time. Lazear (2000) finds that automobile worker output increased after the 

- company switched its compensation scheme from hourly wages to piece rates. Banker et al. (2000) find that 
firm performance improved after customer satisfaction was included as a performance measure for managers. 
Sprinkle (2000) provides experimental evidence regarding how incentive-based compensation contracts compare 
to flat-wage compensation contracts in motivating individual performance. He shows that incentives not only 
increase the duration of effort, but also apparently increase the intensity of effort. 
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performance measures as performance standards provide “an increased degree of legiti- 
macy” due to the inherent credibility of performance achieved by similar units, there has 
been no accounting research that investigates the motivational impacts of benchmarked 
standards. 

Our study fills these gaps in the extant RPE literature by using archival data to provide 
evidence on the performance impact of RPE given common uncertainty and by examining 
whether behavioral responses such as employees’ perceived unfairness can mitigate the 
incentive effect. Our study also explicitly explores the degree of common uncertainty as a 
key factor influencing the net benefits of introducing relative performance measures into a 
contract. Recent research on RPE in executive compensation settings documents that pre- 
vious mixed findings may be due to inappropriate choice of a peer group facing similar 
uncertainty (Albuquerque 2005). Our study supports the view that use of an appropriate 
peer group can significantly influence the motivational effectiveness of an RPE incentive 
plan. 


Ш. RESEARCH SITE 
Description of Research Site 

The research site for this study is a postal service provider in Korea (hereafter, POST). 
POST is the state-owned organization that provides comprehensive postal services through- 
out Korea. POST is chartered as a monopolistic service provider to handle and deliver mail. 
However, it competes against several large domestic couriers and international couriers, 
such as Federal Express and DHL, in its expedited delivery market, which is more profitable 
than regular mail service. Intense competition in the expedited delivery market significantly 
undermines POST’s profitability. Furthermore, rapid diffusion of electronic communication 
such as email and mobile phone service poses a serious threat to POST’s core business. A 
universal service obligation imposed on POST precludes POST from using its discretion to 
close unprofitable stores, such as those located on remote islands. These factors, coupled 
with stagnant mail volume and decreased profit margins caused by strict regulation of postal 
service rates, contributed collectively to troubled financial performance. 

POST's stores are homogeneous in many important operational aspects, such as orga- 
nizational structure, mail handling and delivery process, incentive system, and basic infra- 
structure. However, stores vary significantly in terms of exogenous factors such as geo- 
graphical location, clientele, and competition. Considering that mail volumes are largely 
determined by exogenous factors, it is highly likely that a store’s profitability is conditional 
on such “uncontrollable” exogenous factors.* 


Previous Incentive Plan 


Even before it introduced the new RPE incentive plan in 1998, POST’s incentive plan 
tied each employee’s incentive pay to the performance of the store where the employee 
worked, and not to individual performance. All store-level front-line employees were cov- 
ered by this incentive plan, as stipulated by the Special Act on the Postal Business 
Operation. 


4 


We regressed each store’s mail volume handled per employee on the following major exogenous factors: pop- 
ulation, service area, average delivery distance, number of financial institutions (a proxy for income level), and 
location of store (Jarge/middle/small city, and rural area) using 1997 data (prior to the new plan). The adjusted 
R? is 77 percent, suggesting that a substantial proportion of variation in mail volume can be attributed to 
exogenous factors over which a store manager cannot exercise conuol. Nevertheless, a store manager can influ- 
ence costs and revenues. 
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Each store’s performance score was a weighted average of multiple measures. One of 
the key performance measures for evaluating each store was its profitability growth rate 
relative to its previous-year profit, where store profitability was defined as store revenue 
divided by store operating cost. Profitability growth rate was selected because store prof- 
itability is influenced by “uncontrollable” exogenous factors and POST is obligated to 
provide universal service throughout the country. Therefore, absolute performance does not 
accurately reflect managerial effort, that is, “controllable” performance. Similarly, produc- 
tivity growth rate, where store productivity was defined as mail volume handled per em- 
ployee, was another key performance measure for each store. 

At the end of each year, each store was ranked based on its weighted average evaluation 
score. Incentive pay was provided only for stores whose relative rank fell within the top 
50 percent of all stores. Employees received the incentive bonus, if any, in April of the 
following year. If a store qualified for a bonus, then each employee in the store received 
the same percent of their monthly salary. Bonuses depended on the store’s rank and ranged 
between 50—150 percent of monthly salary, which suggests that the maximum annual bonus 
constituted about 12.5 percent of annual compensation. Management concluded that the 
range of incentives provided had great potential to motivate workers, especially because 
performance-contingent bonuses were unusual among Korean state-owned companies. 

By including the profitability growth rate relative to the previous year as a key per- 
formance measure, this incentive plan partially resolved the problem of comparing absolute 
performance of stores whose business environments are quite diverse. However, it ignored 
the relative efficiency among stores and accordingly did not effectively motivate stores to 
strive toward achieving greater efficiency. The following comments by a senior manager 
exemplify the motivational problems, which are similar to those that arise when a reward 
system ratchets performance standards, inducing employees to reduce effort and expected 
output in one period to avoid a higher standard in the next period (Indjejikian and Nanda 
1999): 


Before we introduced the new incentive plan, individual stores opportunistically ma- 
nipulated the growth rate in profitability ... Stores chose to exert effort on enhancing 
their profitability every two years. By doing so, in one year, they were able to maximize 
the growth rate in profitability relative to the previous year and get incentive pay. The 
next year they just didn't do anything ... Since all of us know that profit growth cannot 
be sustained eternally, basing incentive pay on profitability growth has an intrinsic 
problem." 


New Incentive Plan with Relative Performance Measures 


Tn an attempt to resolve the motivational problems, POST introduced its new incentive 
plan for employees at each store in January 1998, with the objective of rewarding stores 
for their performance relative to an appropriate reference group. Using cluster analysis, 
the new incentive plan classified all stores into nine reference groups, each of which shares 


5 То illustrate, а store’s operating profit might be 100, 150, 100, and 150 in years 1990, 1991, 1992, and 1993, 
respectively. This pattern of profits is consistent with the following explanation. In 1991, the agents exerted high 
effort to maximize profit growth in order to get a bonus. In 1992, effort-averse agents did not exert any effort 
above the minimally required effort because they knew there was little room to improve the performance that 
year. In 1993, they exerted high effort again to maximize profit growth. This opportunistic behavior is attributable 
to the incentive plan that measures current year's performance relative only to the previous year's. 

5 The reference groups remained fixed until November 1999, when the groups were revised. 
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a similar business environment.’ Both relative profitability (store profitability divided by 
the average profitability in the reference group, where store profitability is defined as store 
revenues divided by store operating cost) and relative productivity (mail volume handled 
per store employee, divided by average productivity in the reference group) are key per- 
formance measures on which the largest weights are placed in determining overall perform- 
ance tied to incentive pay. However, bonus eligibility remains the same as under the old 
plan. That is, incentive pay is provided only to the stores whose weighted averages are in 
the top 50 percent of all stores. 

Table 1 summarizes the key similarities and differences between the new and old in- 
centive plans. Both plans incorporate a tournament-type feature in which bonuses are paid 
only if stores rank in the top 50 percent on the weighted average evaluation score. However, 
the new incentive plan develops performance scores that specifically incorporate relative 
performance measures, that is, performance measures relative to stores with similar business 
environments. The new plan places 48 percent of the relative weight on a store’s profitability 
and productivity measures relative to its reference group.® The relative weight placed on 
store growth rates over the previous year was reduced to 12 percent, compared to 60 percent 
under the old plan. 

Figure 1 diagrams POST’s schedule of introducing the new incentive plan. In December 
1997, the framework was circulated to top management. Stores received details of the plan 
in January 1998. Therefore, in testing the impact of the new incentive plan, January 1998 
is considered as the start of the incentive plan. Performance evaluation of stores in calendar 
year 1997 under the new plan was performed in March 1998 and the resulting bonuses 
were paid in April 1998. 


IV. HYPOTHESES 
H1 and H2: RPE Incentives and Degree of Common Uncertainty 


To summarize our context, both the new and the old incentive schemes involve a 
tournament structure in which bonuses are awarded only to employees at stores ranked in 
the top 50 percent, based on the designated performance measure. The new scheme clas- 
sifies the stores into nine reference groups, each of which shares a similar business’ envi- 
ronment. The new incentive plan additionally incorporates relative performance measures 
within each reference group in developing the performance scores used in the tournament 
ranking. 

To develop our hypotheses, we begin by drawing on agency theory. Agency theory 
describes benefits associated with evaluating risk-averse agents on their relative perform- 
ances when the agents’ performances are affected by a common “shock” term (Holmstrom 
1982). These benefits arise when information from a particular agent’s output is supple- 
mented with relative performance measures that the principal can use to reduce the agent’s 
exposure to individual output uncertainty. Reducing the agent’s compensation risk motivates 
the agent to exert higher effort, relative to the contract without the relative performance 


7 Group 1 contains stores in Seoul, which has approximately 11 million people. Groups 2 and 3 contain stores 
in suburbs of Seoul and the next three largest cities (2—4 million people in each city). Groups 4 through 6 
contain stores in smaller, mid-sized cities, and Groups 7 through 9 contain stores in rural areas. The number of 
stores included in each reference group varies from 16 to 33. The arithmetic mean of profitability measures of 
each reference group varies from 0.4412 to 1.5754 (year 1997). Variables used in the cluster analysis include 
key factors that are expected to influence store profitability and productivity. Examples include population, square 
kilometers of the area covered by the store, and number of employees. Those variables are also used as control 
variables in our regression models in Section VI. 

5 In June 2000, the company allowed stores to request to change reference groups, especially if the stores reported 
a significant level of employees’ perceived unfairness due to perceived unettainable performance standards. 
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measures. Because POST’s new incentive plan uses relative performance measures that the 
old plan did not, the new plan is expected to induce employees to provide greater effort, 
and hence expected improvement in financial performance. 

The agency theory arguments above rest on the use of relative performance measures 
to reduce risk imposed on agents, thereby inducing higher effort. This perspective leads to 
a conclusion that there is no intrinsic value to inducing competition by linking each agent’s 
reward to other agents’ performance. However, behavioral theories argue that RPE can 
increase effort because agents interpret evaluation based on relative performance measures 
as a message that competing is appropriate behavior, again leading to higher effort and 
expected improvement in financial performance (Seta 1982; Frederickson 1992). 

A further issue that arises in our context is the concern about the ratchet effect described 
in our earlier anecdote, which reports varying effort from year to year in order to increase 
the likelihood of receiving bonuses. A “reasonably objective” way to set targets or stan- 
dards to encourage effart in multiperiod contexts that are susceptible to a ratchet effect is 
to use RPE (Milgrom and Roberts 1992, 233), as POST did by placing a relatively large 
weight on relative performance measures in determining which stores would receive bo- 
nuses. Thus, the economic and behavioral arguments all point to increased effort and ex- 
pected improved financial performance with the new incentive plan, leading to Hypothe- 
sis 1. 


НІ: Ceteris paribus, introduction of the new RPE incentive plan is positively associated 
with financial performance. 


To develop Hypothesis 2, we first note from agency theory that if the agents' output 
measures are independent (that is, there is no common uncertainty), the principal does not 
gain from relative performance compensation schemes because no additional information 
is gained by comparing agents' outputs, precluding a risk-reducing and effort-increasing 
change in the contract (Holmstrom 1982). Conversely, Mookherjee (1984) develops con- 
ditions in a two-agent principal-agent model under which the principal's expected utility 
varies continuously with the correlation between the two agents' output measures. If the 
correlation is close to 1 (that is, there is maximal common uncertainty), then the principal 
can almost achieve the first-best welfare. Consistent with this theory, we expect that in- 
creases in the degree of common uncertainty are associated with increases in agents' efforts 
and, consequently, financial performance, 

We next draw on referent cognitions theory, which states that a person's “negative 
reactions to a resource allocation decision occur when two conditions are met: (1) the 
outcomes associated with the decision are considerably lower than easily imagined alter- 
native outcomes and (2) the procedures that give rise to outcomes are unfair, thereby ren- 
dering the outcomes unjustified" (Brockner and Wiesenfeld 1996, 193). In a budgeting or 
performance evaluation context, the first condition (“‘distributive justice") is related to fair 
budgets, standards, or targets, and the second condition (“procedural justice”) is related to 
a fair budgeting or target-setting process. Based on referent cognitions theory, Libby (2001) 
predicts and finds in a budgeting setting that subjects’ performance was lowest in the 
condition with an unfair target arising from an unfair process. We argue that in our situation, 
conversely, increases in the degree of common uncertainty increase fairness through both 
procedural and distributive justice. We therefore expect increases in the degree of common 
uncertainty to be associated with higher agent effort and, therefore, improved financial 
performance. Based on these behavioral arguments and our economic arguments, we predict 
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that increases in the degree of common uncertainty are associated with increases in financial 
performance, as stated in Hypothesis 2 below: 


H2: With the new RPE contract, ceteris paribus, increases in the degree of common 
uncertainty are positively associated with financial performance. 


H3 and H4: RPE, Perceived Unfairness, and Degree of Common Uncertainty 

Although economic theory shows that RPE can provide benefits when there is common 
uncertainty, several factors rest outside the theory. Most pertinent to our study and consistent 
with referent cognitions theory, RPE may induce reduced effort that, in turn, causes lower 
expected financial performance when workers perceive that the process of setting the rel- 
ative performance targets and the targets themselves are unfair.? Drawing on referent cog- 
nitions theory, several studies on budgeting provide evidence that unfair budget targets are 
associated with lower performance when employees perceive the budgeting process is unfair 
(Magner et al. 1995; Libby 2001). Under the new incentive plan, stores in POST are 
assigned the average performance within a reference group as their budget target (that is, 
the threshold to achieve a bonus). Although this plan was intended to create fair targets, 
some stores viewed their targets as unfair due to a process with low procedural justice (for 
example, there was no store employee participation in the assignment of stores to reference 
groups). This reasoning suggests that performance will be lower in stores that perceive 
unfairness in the process and targets associated with the new incentive plan. 

We apply the arguments above to develop Hypothesis 3 below. That is, employees' 
perceived unfairness is expected to be associated with lower financial performance. This 
prediction is also consistent with the findings that performance decreases at extremely 
difficult budget levels due to discouragement and less commitment to achieve the target 
(Merchant and Manzoni 1989; Fisher et al. 2003). 


H3: The new RPE incentive plan's impact on performance is mitigated (lower) in stores 
at which the perceived unfairness is likely to be high, relative to other stores. 


Referent cognitions theory stresses the importance of procedural justice in addition to 
distributive justice in relation to feelings of unfair treatment (Cropanzano and Folger 1989; 
McFarlin and Sweeney 1992). In our situation, all the stores are assigned into one of nine 
reference groups, and the perceived fairness of the evaluation procedure is likely to vary 
according to the extent to which a store is sorted into a.more or less homogeneous reference 
group. Some stores reported major problems with dysfunctional responses due to decreased 
morale or skepticism, resulting from the perception that assignment to a particular reference 
group created unfair and unattainable standards for achieving a bonus. The perception of 
unfair assignment to a group and associated unattainable standards suggests that there was 
a low degree of common uncertainty within the reference group. Employees of stores 
assigned to a reference group with high common uncertainty are expected to view the 
process as more fair, in which case referent cognitions theory predicts less resentment and 
perceptions of unfairness, regardless of whether the target is ultimately achievable. We 


? RPE contracts may also generate incentives to sabotage the measured performance of coworkers, to collude with 
coworkers and shirk, and to apply for jobs with inept coworkers (Gibbons and Murphy 1990). However, these 
dysfunctional responses are not very likely to occur in POST. Except for reference group 1 (stores in Seoul), 
each reference group comprises geographically dispersed stores. It is therefore not likely that a store manager 
could sabotage the measured performance of other stores in the same reference group or collude with them. 
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therefore predict that the degree of common uncertainty is negatively correlated with the 
extent to which employees perceive unfairness, as stated in Hypothesis 4: 


H4: The degree of common uncertainty is negatively associated with likely perceived 
unfairness. 


V. METHODS 
Sample 

Annual performance evaluation data were obtained for 1997 through 1999 for POST’s 
214 stores.!° Financial and operational data for individual stores were collected from 
POST's performance measurement system. Control variables such as demographics and key 
statistics for individual stores were collected from the company’s database that tracks in- 
formation on individual stores’ competitive environments. In addition, company documents 
and interviews with store managers provided qualitative data on their expectations and 
comments about the incentive plan. 


Dependent Variables 


For H1 through H3, we use PROFRATE, defined as revenue divided by operating cost 
of a store, as our dependent variable. We use this measure for the following reasons. First, 
this measure is actually used in the new incentive plan as a profitability measure. Second, 
by using PROFRATE, we are able to reduce the likelihood of heteroscedasticity that might 
occur when using alternative financial measures such as revenue or operating profit.!! Third, 
by using this measure we can capture the effect of agents’ efforts on both revenue increase 
and cost reduction.'? 

Discussion with POST’s management revealed that stores initially responded to the new 
incentive plan by reducing operating cost rather than trying to increase revenue because 
mail volume, а store's major revenue driver, is largely determined by “uncontrollable” 
exogenous factors.'? Furthermore, stores reportedly exerted effort to increase other revenue 
sources such as railway or airline ticketing services or e-post (an online shopping mall that 
sells specialty products peculiar to that region). Therefore, PROFRATE should effectively 
capture the plan’s impact on efforts for reducing cost and increasing revenue at the store 


10 We were allowed to access the performance evaluation data for 2000. However, mergers among stores and 
changes in performance measures that occurred in 2000 preclude us from using the data for our analysis. 
Furthermore, in 2000, POST expanded the coverage of bonus eligibility to 80 percent of the stores from the 
previous 50 percent of the stores, for two reasons. First, a significant proportion of stores (47 stores, or 22 
percent of the population) did not receive an incentive bonus for three consecutive years after the new incentive 
plan’s introduction. In response to soaring complaints from these stores and management’s concern about dys- 
functional responses due to perceived unfairness of the benchmarked targets, POST decided to expand the bonus 
coverage. Second, although the new incentive plan significantly contributed to improving POST’s financial 
performance, some critics argued that increased competition and too much emphasis on “profitable” mail service 
induced by the new plan compromised the fulfillment of an essential public duty (i.e., a service obligation that 
guarantees customers a universal standard of postal service at a single price regardless of where they live). To 
at least partially mitigate this concern, POST decided to expand the coverage. 

п Heteroscedasticity is a common problem encountered when the estimation is based on cross-sectional data. One 
way to reduce this problem is to use a deflated dependent variable. For example, Banker et al. (2000) use 
normalized revenue and cost deflated by the number of available rooms as a proxy for the size of a hotel. In 
the “Robustness Analysis" subsection, we also report similar results using operating profit deflated by square 
feet of a store as a dependent variable (Banker et al. 1996). 

2 Postal rates did not change during 1997-1999 and we adjust cost data in 1998 and 1999 for inflation using the 
consumer price index (CPI) computed by the Bank of Korea. 

13 Examples of cost reduction efforts include eliminating unneeded employees, substituting part-timers for regular 
workers, encouraging early retirement of senior employees, and trimming unnecessary costs. 
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level. We also examine the impact of the new incentive plan on productivity by using 
MVOL, which is defined as the natural log of mail volume handled per employee in a store, 
as an alternative dependent variable. 

For H4, which examines the association between the degree of common uncertainty 
and employees’ perceived unfairness, we use proxies for perceived unfairness as the de- 
pendent variable. These proxies are described in the following section. 


Explanatory Variables 
Plan Adoption Indicator Variables 

Hypothesis 1 predicts that holding other factors fixed, introducing the new RPE incen- 
tive plan is positively associated with stores’ financial performance. We test this hypothesis 
using dummy variables FYR_PLAN and SYR_PLAN to identify changes in profitability 
associated with implementation of the new incentive plan, after accounting for changes due 
to the control variables. FYR. PLAN is a dummy variable representing the first year fol- 
lowing new incentive plan implementation, taking the value 1 for 1998, and 0 otherwise. 
SYR_PLAN is a dummy variable representing the second year following new incentive plan 
implementation, taking the value 1 for 1999, and 0 otherwise. 


Degree of Common Uncertainty 

Hypotheses 2 and 3 predict that two contextual factors—degree of common uncertainty 
and perceived unfairness—are likely to moderate the performance impact of the new RPE 
incentive plan. Specifically, H2 predicts that under the new contract, increasing the degree 
of common uncertainty increases financial performance and H3 predicts that the incentive 
plan’s impact on performance is mitigated in stores at which the perceived unfairness is 
likely to be high, relative to other stores. 

We operationalize the degree of common uncertainty (COMMON) using three empirical 
proxies: adjusted R?, normalized residual dispersion, and average Euclidean distance of 
stores in a group from the reference group mean of performance control variables in Equa- 
tion (1). Our first two empirical proxies are obtained by estimating the following regression 
equation for each reference group by year.!^ 


PROFRATE,, = Во + B,SERVICE_AREA,, + B;POP,, + BN. BANK,, 
+ B4N. EMP,, + BDIST,, + Ве DELPOINT,, 
+ B,N_COUNTER,, + ВьМ DELSEC,, + &, (1) 


where: 


PROFRATE = profitability rate for store i, measured as revenue divided by operating 
cost at year г; 
SERVICE_AREA = square kilometers (in hundreds) of the area that a store is responsible 
for at year 5 
POP - the natural log of population (in thousands) of the area that a store is 
responsible for at year t; 


м Unlike in Equations (2) and (3), discussed in Section VL, we could not use variable METRO, which takes on 
the values of 1, 2, 3, and 4 for rural area, small-sized city, middle-sized city, and metropolitan area, respectively, 
in estimating reference-group-specific Equation (1) because, for example, group 1 consists of stores in Seoul 
only. 
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“М.ВАМК = number of financial institutions (in hundreds) in the area that a store 
is responsible for at year 5; 
N..EMP = number of employees of a store at year 5 
DIST = average daily delivery distance (in hundred kilometers) per mail carrier 
of a store at year t; 
N_DELPOINT = average number of daily delivery points (in hundreds) per mail carrier 
of a store at year t; 
N..COUNTER = number of available counters (in hundreds) of a store at year 5 
N_DELSEC = number of delivery districts (in hundreds) of a store at year t; and 
t = 1997-1998. 


Our first proxy, the adjusted R? obtained by running the above reference-group-specific 
regression, captures the extent to which the variation in profitability is explained by the 
variation in independent variables within each reference group. Considering that the inde- 
pendent variables are those used in the cluster analysis to form the nine reference groups, 
our adjusted R? measure should effectively capture the degree of common uncertainty.’ 
Since the magnitude of R? can be noisy,!® we use a ranking on adjusted В? (RSQRANK) 
that ranges from 1 to 9, with higher values indicating greater R?. 

Gu (2002) proposes using regression residual dispersions with proper control for scaling 
as an alternative to the regression R? and as a comparable measure of explanatory power 
in earnings' value-relevance studies. Gu (2002) points out that the residual variances are 
subject to the scale of the dependent variable, with high levels of the dependent variable 
generally associated with high residuals. Accordingly, we control for this scaling effect by 
dividing the estimated residual variance o, by the sample mean of our dependent variable 
PROFRATE in constructing our second proxy. To facilitate interpretation, we use a ranking 
on normalized residual variance (RESIDRANK) that ranges from 1 to 9, with higher values 
indicating smaller residual variance. 

Our third proxy is the average of store-level Euclidean distance from the arithmetic 
mean of performance control variables of stores that belong to a specific reference group." 
A rationale behind this measure is that a reference group whose stores are more closely 
populated along the within-reference-group mean of various controls is likely to exhibit 
more homogeneity among its members. We use a ranking on distance (DISTRANK) that 
ranges from 1 to 9, with higher values indicating smaller distance from the mean. 


15 "This situation is analogous to the RPE literature on executive compensation documenting that the relevant peer 
group should be companies in the same industry rather than the entire stock market, in the sense that there 
should be more correlation in common shocks (i.e., greater common uncertainty) within the same industry 
(Prendergast 1999). 

16 For example, in 1997, the adjusted R? of the nine reference groups ranges from 0.159 to 0.863. For all of our 
proxies for COMMON, we report the results using the rank values of our three proxies for the degree of common 
uncertainty instead of the raw values in Section VI. We also discuss the results using raw values in the robustness 
analysis. 

17 Huclidean distance is the distance between two p-dimensional items and is widely used for cluster analysis (e.g., 
in the k-means method). Specifically, our measure is computed using the within-reference-group (RG) average, 


5 
d(Store, RGmean) = i> (x, — XP, where x, = ЗЕКИСЕ_АКЕА, x, = POP, x, = М_ВАМК, x, = DIST, 
kel 


and x, = N_EMP. We firs: regress PROFRATE on ће eight performance controls included in Equation (1) and 
retain five variables that are statistically significant predictors of PROFRATE and use them to compute Euclidean 
distance. All variables are standardized before calculating Euclidean distance because it is a convention to 
standardize variables before performing cluster analysis and our various performance controls used in Equation 
(1) are measured in different units Johnson and Wichern 2002). 
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Perceived Unfairness 

Another key issue is how to identify stores at which the level of perceived unfairness 
is likely to be high (UNFAIR) under the new incentive plan. Interviews with store managers 
revealed that the key negative response arising from the new plan was “abandonment.” 
That is, employees who feel that their store reference group is too “strong” for them are 
not well motivated to exert effort to exceed the reference group's benchmark performance 
standard. In our analysis, we use two empirical proxies for UNFAIR. First, we use indicator 
variable CHANGE, which equals 1 for the 13 stores that filed a formal request to head- 
quarters to change reference group in mid-2000.!? These stores were allowed to file a formal 
complaint to postal headquarters only after a regional postmaster carefully investigated the 
store's reported problems. We infer that they are likely to have suffered from high levels 
of perceived unfairness due to feelings such as abandonment.or skepticism of the new plan. 

We also create an ordinal variable, UNSATIS, to measure potential perceived unfairness. 
An internal survey óf the new incentive plan's impact revealed that employees became the 
most frustrated when they did not receive a bonus even though they improved performance 
relative to the previous year. This survey result is consistent with the prediction of referent 
cognitions theory that “Resentment is maximized when people believe they would have 
obtained better outcomes if the decision maker had used other procedures that should have 
been implemented" (Cropanzano and Folger 1989). Accordingly, we compared each store's 
rank on its profitability divided by the mean of its reference group (the profitability measure 
under the new plan) with the store's rank on profitability growth, and define UNSATIS as 
the decile ranking based on the difference, with higher values indicating higher perceived 
unfairness.? In sum, we expect а store to exhibit high perceived unfairness if that store’s 
profitability relative to its peers is low even though it achieves high profitability growth. 


Control Variables 

POST's universal adoption of a new incentive plan across all the stores precludes us 
from using stores that did not change the incentive plan as a natural control sample. We 
therefore include a comprehensive set of proxy variables that likely control for differences 
in the business environments of stores in the estimation model, in order to isolate the 
incentive effect attributable to the new plan (Dopuch and Gupta 1997).” We first include 
the variables that were actually used in the cluster analysis to form POST's nine reference 
groups as performance controls, because POST believes those variables are highly corre- 
lated with a store's profitability. Variables such as POP, SERVICE. AREA, and METRO 
(which takes on the values of 1, 2, 3, and 4 for rural area, small-sized city, m:ddle-sized 
city, and metropolitan area, respectively), are also widely. used as controls in the literature 
on analyzing the productivity in global postal services (e.g., Maruyama and Nakajima 2002, 


18 As described in Figure 1, POST adjusted the reference group for each store by performing a second cluster 
analysis in November 1999. Therefore, the stores whose reference group was changed in November 1999 could 
be considered those at which perceived unfairness is likely to be high. However, since new variables were added 
in the second cluster analysis, the change of reference group could be attributable to the addition of these new 
variables in the analysis. Therefore, we chose to use the 13 stores that filed a formal request to postal headquarters 
to change reference group in 2000 as stores with high “perceived unfairness.” 

1? We do not have complete nonfinancial and cash flow target data to construct the new plan's total score. The 
two most highly weighted measures, relative profitability and relative productivity, are highly correlated, and so 
are their ranks (the correlation is 0.90, based on 1997—1999 data). Moreover, relative profitability is much more 
controllable than relative productivity. We therefore use the rank of relative profitability alone in defining 
UNSATIS. 

20 [n estimating benchmark performance standards for public school expenditures, Dopuch and Gupta (1997) used 
proxy variables to reflect differences in socioeconomic factors that are used by the State of Missouri in setting 
a funding formula. 
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2003). However, among these variables, we do not include the number of employees of a 
store (М. ЕМР) as a control, in order to reflect store-level efforts to reduce the number of 
employees in response to the new incentive plan. 

We additionally control for other factors that may influence store performance. Our 
METRO variable, which captures the economic viability of the area in which a store op- 
erates, also controls for the level of competition. This is because major competitors of 
POST, such as domestic and international couriers, operate mainly in profitable urban areas. 
We control for average labor cost per employee of a store (SKILL), in million Korean won, 
to capture the difference in the ability of the work force given that.pay level increases with 
job tenure in POST. Finally, we control for the effect of product mix on store profitability 
with PROD..MIX, which is measured by the relative proportion of more profitable mail 
services (e.g., parcel, registered mail, and express mail) to total mail volume received by a 
store. If data on postmaster tenure were available, it could be included as an additional 
explanatory variable as a proxy for managerial expertise. Postmaster tenure data are not 
available, but because POST strictly enforces mandatory job rotation policy about every 
three’ years, we do not expect postmaster tenure to be highly correlated with store 
profitability. 

Although we include many factors that may influence store-level performance, one 
should use caution in claiming that these control variables include all exogenous factors 
that affect a store's profitability. In addition, some control variables may suffer from mea- 
surement error. For example, М BANK is included to control for the effect of average 
income on profitability. More appropriate proxies for average per capita income, such as 
income tax paid or average price of houses, were not available. Our concern about not 
including a more direct proxy for per capita income is alleviated by the fact that the whole 
nation's average per capita income in 1998 relative to 1997 decreased due to the macro- 
economic impact of the nation's financial crisis in late 1997.21 If income is a significant 
determinant of a store's profitability, then that should bias against our results. Furthermore, 
the rapid substitution of electronic communication such as email and mobile phone service 
for traditional surface mail in the late 1990s should also bias against finding our results. 


Descriptive Statistics 


Descriptive statistics of the variables used in the analyses appear in Panel A of Table 


2.22 For comparison, we provide descriptive statistics for performance under the old contract 


and the new RPE contract by year. Overall, the average financial performance of stores 
improved in the two years following introduction of the new plan. On average, operating 
profit increased from 60 million under the old plan to 556 million in the first year and to 
916 million in the second year (1997-constant Korean won), due to increased revenues and 
reduced operating costs. One interesting finding is that the average operating cost of stores 
decreased substantially (by 11.1 percent) in 1998, the first year after introduction of the 


21 Since the Korean economic crisis in 1997 heavily affected the nation's economy, including mail volume, one 
might argue that improved performance in 1998 may be simply attributable to mail volume increases due to 
overall economic recovery from the crisis rather than incentive plan change. However, the negative economic 
impact of the crisis was the most conspicuous in 1998, not in 1997, because the crisis occurred in late November 
1997. For example, real GDP growth rate in the Korean economy in 1998 was —6.7 percent compared to 6.8 
percent in 1996 and 5.0 pezcent in 1997 (Crotty and Lee 2001). This suggests that any improvement in POST's 
store performance in 1998 is not likely to be the artifact of overall economic recovery from the crisis because 
1998, the first year after the incentive plan change, was the year that was affected the most severely by the 
crisis. For a detailed chronological description of major events during the crisis, see Baek et al. (2004). 

2 Due to store-level missing data in some of our control variables, the final number of observations used in 
estimating our regression models is 619. 
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new plan, consistent with our discussions with management that stores initially responded 
to the new incentive plan by reducing operating cost rather than by increasing revenue. 
However, this sharp decrease in operating cost relative to the previous year almost disap- 
pears in the second year (1999) and continued profit growth in the second year appears to 
be achieved through increased revenue rather than reduced cost. 

The large standard deviation in operating profit implies that stores vary significantly in 
terms of profitability. The profitability rate.(revenue divided by operating cost) increased 
from 0.84 during the under the old plan to 0.98 in the first year and to 1.07 in the second 
year following introduction of the new plan. An average profitability measure that is smaller 
than 1 in 1997 suggests that on average, stores were suffering losses. The results of the 
paired t-test, not tabulated, show that most of the control variables are not statistically 
different under the old and new plans. This provides preliminary evidence that the change 
in profitability, if any, could be attributable to the implementation of the new RPE plan. 

Panels B and C of Table 2 present store profitability (PROFRATE) and change in 
profitability (APROFRATE) by classifying the stores based on the degree of common un- 
certainty and perceived unfairness using the data after introduction of the new plan. 

As seen in Panel В of Table 2, the high degree of common uncertainty (DCU) stores 
exhibit a statistically higher level of PROFRATE and APROFRATE than the low DCU stores 
in the years following the new plan. Similarly, Panel C shows that the high perceived 
unfairness (PU) stores exhibit statistically lower levels of PROFRATE and APROFRATE. 
These results are robust to the alternative measures of the degree of common uncertainty 
and perceived unfairness. Consistent with our H2 and H3, these findings provide prelimi- 
nary evidence on the role of the degree of common uncertainty and perceived unfairness 
as moderating factors of performance improvement in response to the new incentive plan. 
These results, however, should be interpreted with caution because the analysis in Table 2 
does not control for store attributes that potentially influence profitability. 


Univariate Correlations 

Table 3 provides details on Pearson correlations among the variables used in the anal- 
yses. Square kilometers of the area (SERVICE. AREA), average number of daily delivery 
points per mail carrier (N..DELPOINT), and average daily distance per mail carrier (DIST) 
are negatively and significantly correlated with profitability. Other independent variables 
such as number of financial institutions (V_BANK) and population (POP) are all positively 
and significantly correlated with a store's profitability. 

As expected, correlations among alternative definitions of the degree of common un- 
certainty (RSQRANK,_,, RESIDRANK,_,, DISTRANK, |) are significantly positive and store 
profitability (PROFRATE) is positively correlated with all three variables. PROFRATE is 
also significantly negatively correlated with alternative definitions of perceived unfairness 
(CHANGE and UNSATIS,_,). The correlation between CHANGE and UNSATIS, is positive 
but not significant.? UNSATTS, , is significantly negatively correlated with all three proxies 
of the degree of common uncertainty, providing preliminary support for our H4. The cor- 
relation between CHANGE and our three common uncertainty variables is negative but not 


significant. ` 


23 Since we provide correlations among variables used in our Equations (2) and (3) in Table 3, we were unable to 
include correlations among measures of common uncertainty and perceived unfairness during the 1998-1999 
period that is actually used for testing H4. In this time period, the correlation between CHANGE and UNSATIS 
is positive and significant (0.14, p-value — 0.004). UNSATIS is significantly negatively correlated with all three 
proxies of the degree of common uncertainty. The correlation between CHANGE and the three common uncer- 
tainty variables is negative, but it is significant only with DISTRANK. 
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VL EMPIRICAL MODELS AND RESULTS 
Performance Impact of New Incentives 
To test H1, we estimate the following fixed-effects regression equation using pooled 
cross-sectional time-series data for the 214 stores: 


PROFRATE,, = o, + o,FYR РГАМ,, + «,SYR_PLAN,, + a,SERVICE. AREA,, 
+ o4POP,, + аз BANK,, + agMETRO,, + o;DIST,, 
+ agN_DELPOINT,, + аз COUNTER,, + e, N. .DELSEC,, 
+ а SKILL, + 0,PROD_MIX,, + є, (2) 


where t = 1997~1999. 

To control for any store-specific omitted variables, we estimate Equation (2) using 
fixed-effects regression. The coefficient оп FYR. PLAN captures the average difference in 
profitability measures between the first year after the new incentive plan (year 1998) and 
base year 1997. Similarly, the coefficient on SYR_PLAN captures the average difference in 
profitability measures between the second year after the new incentive plan (year 1999) 
and base year 1997. Significant positive coefficients оп РУК РІАМ and SYR..PLAN after 
controlling for other exogenous factors are interpreted as their average impact on financial 
performance, thus providing support for НІ. 


Moderating Effects of Degree of Common Uncertainty and Perceived Unfairness 


To jointly test H2 and H3, we estimate the following fixed-effects regression equations 
using store-level data for 1998—1999: 


PROFRATE,, = 8) + 8; COMMON, + 8,UNFAIR,, + 8,SERVICE_AREA,, 
+ 8,POP,, + 8,N_BANK,, + 8,МЕТКО,, + 8,DIST,, 
+ 8,N_DELPOINT,, + б COUNTER,, + бур М DELSEC,, 
+ 8,,SKILL,, + 8;,PROD_MIX,, + ер (3) 


where г = 1998—1999. 

We use RSQRANK, RESIDRANK, and DISTRANK as alternatives for COMMON, and 
we use CHANGE and UNSATIS as alternatives for UNFAIR. We expect the coefficient on 
COMMON to be significantly positive, which suggests that with the new RPE contract, a 
store’s degree of common uncertainty within its reference group is positively associated 
with its financial performance. Since the profitability of a store relative to its reference 
group receives the largest weight in determining bonuses under the RPE-based plan, we 
expect the coefficient on UNFAIR to be significantly negative. This would suggest that the 
new incentive plan’s performance impact is mitigated in stores that suffered from a high 
level of perceived unfairness, indicating that the new RPE-based plan’s incentive effect is 
conditional on the employees’ perception of fairness of the benchmarked target.?* 


?* Recall that H4 predicts a negative relationship between the degree of common uncertainty and perceived un- 
fairness, both of which are included as key explanatory variables in Equation (3). It follows that common 
uncertainty may have both direct and indirect effects on store profitability. For example, a high degree of common 
uncertainty may positively affect store profitability directly, and also positively affect store profitability indirectly 
through lowered perceived unfaimess. Our multiple linear regression model does not allow us to reliably dis- 
entangle the direct effect from the indirect effect of the degree of common uncertainty on performance. Ac- 
cordingly, the effect of the degree of common uncertainty on performance as captured by the coefficient on 
COMMON may be understated because we explicitly control for perceived unfairness in the regression. 
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We use COMMON,_, and UNSATIS, , in estimating Equation (3) under the assumption 
that the employees will exert effort based on their perception of the benchmarked perform- 
ance standards in the previous year. Since actual performance evaluation of stores for year 
t is performed in March of year t+1, stores do not know the benchmarked performance 
standards against which their performance will be compared during year t. Thus, we assume 
that the benchmarked performance standard used in the year 1—1 evaluation serves as a 
reasonable estimate of a performance standard for year 7 evaluation. 


Association between Perceived Unfairness of Benchmarked Target and Degree of 
Common Uncertainty 


To test the negative association predicted in H4, we estimate two regression equations 
using store-level data for 1998—1999. Unlike in Equations (2) and (3), in which store 
performance (PROFRATE) is our dependent variable, our dependent variables here are our 
two proxies for store-level perceived unfairness, namely, CHANGE and UNSATS. We first 
estimate Equation (4), a binary logit regression examining the association between 
CHANGE and COMMON. We then estimate Equation (5), an ordered logit regression ex- 
amining the association between UNSATIS and COMMON. Hypothesis 4 predicts that the 
coefficient on COMMON will be significantly negative, suggesting that agents are more 
(less) likely to suffer from perceived unfairness under the new RPE incentive contract if 
the degree of common uncertainty is low (high) within their reference group.?* 

Prob(CHANGE = 1) = F(a) + a,COMMON,, + a,SERVICE_AREA,, 
+ о;РОР,, + a,N_BANK,, + а,МЕТКО,, 
+ agN_EMP,, + a,DIST,, + а DELPOINT., 


+ о, М_СООМТЕК,, + o,9N. DELSEC;, 
7 
+ 2, о, О ЕСТОМ) | (4) 


Prob(UNSATIS) = F(8, + 8;COMMON,, + 8,SERVICE_AREA,, + 8,FOP., 
+ 8,N. BANK,, + 5,METRO,, + 8.N_EMP,, 
+ 8,DIST,, + 8sN_DELPOINT,, + &,N. COUNTER,, 


7 
+ 5 М. DELSEC,, + У, S, о КЕСТОМ) 
iei 
(5) 
where t = 1998—1999. 


One potential issue that arises is controlling for factors related to store-level variation 
in the budgeting process that may influence the perceived fairness of the bucget target. 





25 We estimate logistic regressions of Equations (4) and (5) by including seven region dummies to control for 
unspecified geographical region-specific differences in stores’ profitability rather than store-specific fixed effects 
because in a fixed-effects logit model, the ability to estimate parameters is corrupted by an шарш-у to estimate 
consistently the subject-specific effects (Frees 2004). The seven regions, which roughly correspond to the prov- 
inces, are defined by the area covered by each of POST's regional headquarters. The REGION variables cor- 
respond to the eight regions, minus one that is the base group (the province that includes Seoul, the capital and 
largest city in Korea). These dummies are included to control for the effect of macroeconomic factors that are 
peculiar to the region in which a store is located. 
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However, because all the stores were required to adopt the same new plan, we do not have 
such store-level budgeting factors. We do attempt to control for potential effects of the 
different business environments facing each store—although exploratory, we include the 
nine exogenous variables used by POST in forming reference groups. 


Impact of Incentive Plan Change on Financial Performance 


Column (1) of Table 4 presents the estimation results of Equation (2) examining the 
new incentive plan’s impact on profitability (that is, the profitability rate). The coefficients 
on FYR_PLAN and SYR_PLAN, after controlling for other exogenous factors, are positive 
and significant (p < 0.0001 for both, two-tailed), indicating that the incentive plan change 
has a positive and significant effect on financial performance. Thus, our results support 
Н1.26 To assess the economic significance of the result, the coefficients оп РУК PLAN and 
SYR РГАМ suggest that the introduction of the new incentive plan is associated with the 
increase in average store profitability by 0.137 in the first year (1998) and by 0.221 in 
the second year (1999) compared to the base vear (1997) after controlling for other factors 
that may influence store profitability. 

Column (2) of Table 4 shows that the results are very similar when mail volume handled 
per employee (MVOL) is used as an alternative dependent variable. This suggests that 
improved productivity caused by the new incentive plan plays a role in driving profitabil- 
ity." As discussed in Section V, interviews with management and store managers revealed 
that the initial profitability improvement in response to the new incentive plan is mainly 
attributable to reducing labor and overhead cost. Given that mail volume is largely deter- 
mined by exogenous factors, a positive and significant incentive effect on productivity 
appears to be driven by stores' efforts to eliminate unnecessary employee hours. 

To provide insight into whether the new plan's impact on performance persisted and 
grew over time (і. е. continuing improvement), we test equality of the coefficients on 
FYR_PLAN and SYR. PLAN. Based on the result of the F-test, we reject the null hypothesis 
that the two coefficients are equal at the 1 percent level, suggesting that store employees 
exerted more effort to learn how to perform better and were motivated to further improve 
profitability in the second year following implementation of the new ріап.28 


Effect of Degree of Common Uncertainty and Perceived Unfairness 
on Financial Performance 


Table 5 provides the results from estimating Equation (3) with the six combinations 
arising from crossing three measures of COMMON with two measures of UNFAIR. The 


?5 We compute variance inflation factors (VIF) and the condition index to investigate potential inference problems 
due to collinearity. Most of the variance inflation factors for the variables are less than 2 and the largest variance 
inflation factor is 5.1, which is smaller than the threshold of 10 in the econometrics literature (Kennedy 1992). 
The largest condition index number is 33, which is slightly larger than 30, the cut-off proposed by Belsley et 
al. (1980). We investigate the proportion of variation contributed by each variable and find that SKILL is 
responsible for the condition index being over 30. However, our findings are robust to excluding SKILL from 
the model. 

27 Since the use of mail volume handled per employee (MVOL) as an alternative dependent variable produces very 
similar results, the subsequent analyses use only PROFRATE as the dependent variable. 

25 When MVOL is used as the dependent variable, the coefficient on SYR. PLAN is still statistically larger than 
that on FYR..PLAN, but the difference between the two becomes smaller than when we use PROFRATE as our 
dependent variable. This suggests that the new plan's impact on continuing improvement of profitability in the 
second year was greater than its impact on productivity because, in comparison to the productivity measure over 
which stores could not exert much control, the profitability rate provides us with a more comprehensive per- 
formance measure in the sense that it reflects stores’ efforts to trim overhead costs and to increase revenue 
sources other than those generated by mail volume. 
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coefficient on COMMON is positive and significant at the 5 percent level (two-tailed) except 
in column (1), where the coefficient is significant at the 10 percent level, and column (5), 
where the coefficient is not significant. These results suggest that holding other factors 
fixed, a greater degree of common uncertainty leads to higher store performance under the 


TABLE 4 


Regressions Examining the Impact of Incentive Plan Change on Performance 
(t = 1997-1999) 


PERF,, = ар  o4FYR..PLAN,, + a,SYR_PLAN,, + o,SERVICE_AREA,, + ацРОР,, 
+ а: М_ВАМК,, + agMETRO,, + «,DIST,, + сі - РЕГРОІМТ,, 
+ agN_COUNTER,, + аз М DELSEC,, + as, SKILL, + a;PROD. MIX,, 


t Е, 
(1) (2) 
РЕКЕ 
PROFRATE MYOL 
Variable ____ Predicted Sign Coefficient (t-statistic) Coefficient (t-statistic) 
Constant ? 2281* 1.958 
(1.84) (1.38) 
FYR_PLAN (+) 0.137*** 0.244*** 
(6.17) (9.64) 
SYR_PLAN (+) 0221*** 0.304*** 
(10.07) (12.12) 
SERVICE_AREA T —0.017*** —0.021*** 
(—5.24) (—5.55) 
POP ? 0.093 *** 0.258*** 
(5.15) (12.53) 
N..BANK ? 0.239*** 0.249*** 
(9.90) (9.01) 
METRO ? 0.194*** 0.212*** 
(9.95) ` (9.£5) 
DIST ? —0.058** —0.(91*** 
(—2.39) (-3.28) 
N..DELPOINT ? 0.0002 0.001 
(0.18) (0.81) 
N..COUNTER ? 0.020 0.C48* 
(0.84) (1.77) 
N..DELSEC ? —0.026 —0.497*** 
(–0.47) (—7.54) 
SKILL ? —0.179** 0.433*** 
(-2.41) (5.69) 
PROD_MIX ? 0.215. -0.019 
(1.51) (-0.12) 
n 619 619 
Adjusted R? 80.2% 83.8% 
F-test of Hy: a, = a, ` p < 0.0001 р = 0.004 
Change іп В? by 
FYR_PLAN and SYR_PLAN 3.4% ++ 43 *** 


(continued on next page) 
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*, эж, **** Significant at the 10 percent, 5 percent, and 1 percent levels, respectively (two-tailed). 
Reported t-statistics are based on the White (1980) standard errors. 
Variable PERF represents the rate of profitability (PROFRATE) or mail volume handled per employee (MVOL). 
Variable Definitions: 
FYR_PLAN = dummy variable representing the first year following RPE incentive plan implementation, 
taking the value 1 for 1998, and 0 otherwise; 
SYR_PLAN = dummy variable representing the second year following RPE incentive plan implementation, 
taking the value 1 for 1999, and 0 otherwise; 
SERVICE АКЕА = square kilometers of the area that a store is responsible for at year t (in hundreds); 
POP = the natural log of population (in thousands) of the area that a store is responsible for at 


year t; 
N..BANK = number of financial institutions in the area that a store is responsible for at year г (in 
hundreds); 
METRO = variable taking on the values of 1, 2, 3, and 4 for rural area, small-sized city, middle-sized 
city, and metropolitan area, respectively; 

DIST = average daily delivery distance per mail carrier of a store at year t (in hundred kilometers); 
N_DELPOINT = average number of daily delivery points per mail carrier of a store at year ¢ (in hundreds); 
N..COUNTER = number of available counters of a store at year t (in hundreds); 

N_DELSEC = number of delivery districts of a store at year t (in hundreds); 
SKILL = average labor cost. per employee of a store (million Korean won); and 
PROD. MIX = relative proportion of parcel, registered mail, and express mail to total mail volume received 
by a store. 


new plan. This finding suggests that it is critical to explicitly take into account the major 
differences in environments of business units when forming a reference group. The use of 
peer group performance to gain information about an individual employee's effort and to 
insulate employees' compensation from risks they cannot control is more valuable in con- 
tracting with a higher degree of common uncertainty. 

On the other hand, the coefficients on the proxies for UNFAIR allow us to test the 
predicted relation between perceived unfairness and store performance. For all six regres- 
sions, the coefficients on UNFAIR are negative and significant. When UNSATIS, , proxies 
for UNFAIR, the coefficient is negative and significant (p « 0.0001, two-tailed) as seen in 
columns (2), (4), and (6), providing strong evidence on our prediction that the new RPE 
plan's incentive effect will be mitigated in stores with likely high perceived unfairness. 
When CHANGE proxies for UNFAIR, the coefficient is negative and marginally significant 
(p = 0.06 to 0.07, two-tailed) as seen in columns (1), (3), and (5), providing weak evidence 
that the new plan's incentive effect was attenuated in stores with high perceived unfairness. 

To examine the economic significance of the results, we assess the marginal effect of 
COMMON and UNFAIR on store profitability. For example, column (1) suggests that stores 
that requested to change their réference group in 2000 (CHANGE) exhibit lower profitability 
than other stores by 0.088, holding all else constant. Similarly, column (2) suggests that a 
one-unit increase in decile ranking of UNSATIS, | is associated with a 0.038 decrease in 
store profitability, holding all else constant. On the other hand, a change in membership to 
a reference group with a higher degree of common uncertainty (COMMON,_,) is associated 
with approximately a 0.01 increase in store profitability. Taken together, these results sug- 
gest that the new RPE contract’s incentive effect is mitigated by perceived unfairness and 
that the results are robust to alternative ways of identifying stores with a high level of 
perceived unfairness. 
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Association between Perceived Unfairness and Degree of Common Uncertainty 


Panel A of Table 6 provides the results from estimating the binary logit regression that 
examines whether the degree of common uncertainty is negatively associated with the 
likelihood of predicting stores with high perceived unfairness when CHANGE is used as a 
dependent variable. Consistent with H4, when COMMON is measured with adjusted R?, 
the coefficient on COMMON is negative and significant (p = 0.002, two-tailed). This sug- 
gests that stores are more (less) likely to suffer from employees’ perceived unfairness under 
the new RPE incentive contract if the degree of common uncertainty is low (high) within 
their reference group. The result also implies that the perceived unfairness reported in 13 
stores after the new contract could in part be attributable to low common uncertainty in 
their reference group. The coefficient on COMMON is also negative and significant (p 
= 0.045, two-tailed) when residual dispersion is employed as a proxy for COMMON. 
The coefficient on COMMON is not statistically different from zero (р = 0.17) when the 
average store-level Euclidean distance within reference group is employed as a proxy for 
COMMON. 

For additional analysis, we estimate Equation (5), in which UNSATIS is regressed on 
COMMON using an ordered logit model. Panel B of Table 6 shows that the coefficient 
on COMMON is negative and significant (р = 0.018, two-tailed) when COMMON is mea- 
sured with residual dispersion. The coefficient is also negative and significant (р = 0.034, 
two-tailed) when COMMON is measured with the average store-level Euclidean distance 
within reference group, but is not significant with adjusted R? as a metric for COMMON. 

Even though the results from estimating logit regressions make it difficult to draw 
unambiguous inferences on the association between the degree of common uncertainty and 
perceived unfairness, we believe that taken together, the results of Tables 5 and 6 suggest 
that the net benefits of introducing the new RPE contract are conditional on the degree of 
common uncertainty. This is because the degree of common uncertainty appears to be 
inversely related to likely perceived unfairness related to introducing the contract. 


Robustness Analyses 

We assess the sensitivity of the primary results in a number of additional ways. Instead 
of tabulating the results of the additional tests, we discuss them briefly in this subsection. 

First, we include the residuals obtained by annual regression of mail volume handled 
per employee of each store on major exogenous factors—pcpulation, service area, average 
delivery distance, number of financial institutions—and location of store (the METRO vari- 
able) as an additional explanatory variable, and re-estimate Equations (2) and (3). By doing 
so, we additionally control for unknown omitted factors that may influence store revenue 
through change in mail volume. The coefficient on FYR. PLAN and SYR. PLAN remains 
highly significant (t-statistic — 7.49 and 13.72, respectively, two-tailed) and all the proxies 
of common uncertainty and perceived unfairness continue to exhibit expected signs except 
that the coefficient on RESIDRANK becomes insignificant. 

Second, instead of PROFRATE, we use operating profit scaled by square feet of a store 
as an alternative dependent variable and repeat our analysis on the impact of change in 
incentive plan. The coefficient on КУК PLAN and SYR. PLAN remains highly significant 
(t-statistic = 5.10 and 7.92, respectively, two-tailed). 

Third, following prior literature (Ittner and Larcker 1998b; Banker et al. 2000), we 
include prior performance (PROFRATE,_,) as a control instead of including various per- 
formance controls described in Section V. The coefficient on FYR. PLAN and SYR. PLAN 
remains highly significant (t-statistic = 8.60 and 7.26, respectively, two-tailed) and our 
inferences remain unchanged. 
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Fourth, we re-estimate Equation (3) using year 1999 data only instead of 1998 and 
1999 data in order to allow for the possibility that it may take considerable time before the 
perceptions of common uncertainty and unfairness form and influence performance. Our 
inferences regarding H2 and H3 remain qualitatively unchanged except that the coefficients 
on CHANGE become insignificant. 

Fifth, we estimate Equations (2) and (3) using standard ordinary least square (OLS) 
after including the seven REGION dummy variables to control for unspecified geographical 
region-specific differences in stores’ profitability. The estimation results are very similar to 
those obtained by fixed-effects regressions. We also estimate Equations (3), (4), and (5) 
using the raw values of our proxies for COMMON in place of rank values and our inferences 
remain qualitatively unchanged, suggesting that using the rank value does not confound 
our findings. 

Sixth, we estimate Equation (3) after we include a vear 1999 dummy variable to control 
for a potential “learning effect" in the second year following the new plan. Our inferences 
are not changed. 

Finally, we estimate Equations (3) and (4) using the following two alternative ways of 
measuring the level of perceived unfairness (UNFAIR). As a variation of UNSATIS, we 
employ a dummy variable representing stores that exhibited top 50 percent profitability 
growth, but bottom 50 percent relative profitability. As a variation of CHANGE, we use a 
dummy variable for the 37 stores that in November 1999 had their reference group changed 
to a “lower” reference group relative to their first reference group in December 1997. Our 
inferences remain unchanged. 


ҮП. CONCLUDING REMARKS 

Using annual performance evaluation data of 214 stores of a postal service provider in 
Korea during 1997 through 1999, we find that adding relative performance measures to an 
incentive plan is positively associated with financial performance at the store level and that 
. under the new incentive plan, the degree of common uncertainty is positively associated 
with store profitability. Furthermore, we find evidence that the new plan's incentive effect 
is mitigated in stores at which the employees' perceived unfairness is likely to be high. 
Finally, we find that the net benefits of introducing the new contract may be conditional 
on the degree of common uncertainty because the degree of common uncertainty appears 
to be inversely related to perceived unfairness related to the new contract, with its intro- 
duction of relative performance measures. 

Our results are especially striking given that the postal service provider did not switch 
from a pure non-RPE incentive plan to an RPE plan. Rather, both plans had an element of 
RPE through a tournament structure in which only employees in stores that scored in the 
top 50 percent on the designated performance measure received a bonus. Under the new 
incentive plan, the postal service classified all stores into nine reference groups, based on 
business environments. The new plan also introduced a performance measure that placed 
comparatively large weights on profitability (i.e., store revenue divided by store operating 
cost) relative to average reference-group profitability, and on productivity (i.e., mail volume 
per store employee) relative to average reference-group productivity. 

Our results have several implications for implementation of RPE incentive plans. First, 
even though an RPE incentive plan appears to induce agents to exert more effort by reducing 
risk and introducing competition, a significant level of dysfunctional responses can occur, 
for example, due to decreased morale or skepticism brought on by employees' perceived 
unfairness of benchmarked targets. Second, the observed perceived unfairness caused by 
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introducing an RPE contract may mitigate the incentive effect. Third, our study suggests 
that the level of perceived unfairness increases as the degree of common uncertainty de- 
creases. This suggests that careful attention should be directed to setting the benchmark 
standards against which the performance of business units is compared, and it is critical to 
explicitly take into account the major differences in environments of business units when 
forming a reference group. This supports conventional wisdom that benchmark units should 
be from similar environments (Dopuch and Gupta 1997; Brickley et al. 2003). 

This study’s findings are subject to several important caveats. First, this study uses 
observations from a single firm and it is an open question whether we can generalize results 
from a single research site. Second, although we perform a number of robustness tests, the 
absence of a natural control sample and long time-series data make it difficult to completely 
rule out all alternative explanations for our findings. Third, we are unable to measure ex 
ante perceived unfairness, i.e., employees’ perception of the incentive plan as unfair before 
the bonuses were granted. Instead, we develop ex post proxies that attempt to measure 
perceived unfairness after the bonuses were granted, but our measure may in part also 
capture dissatisfaction with not receiving a bonus. Future research could attempt to replicate 
our findings in a setting that controls for our limitations. 
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L INTRODUCTION 
e provide evidence on how investor valuation of derivative financial instruments 
differs depending upon whether the fair value of these instruments is recognized 
or disclosed.! We find that in a period prior to SFAS No. 133 (FASB 1998), 
the valuation coefficients on recognized derivatives are statistically significant, whereas the 
valuation coefficients on disclosed derivatives are not significant. Furthermore, for a sample 
of banks that held only disclosed derivatives before SFAS No. 133, we find that the valu- 
ation coefficients on disclosed derivatives are not significant before SFAS No. 133 (when 
they were disclosed), but are significant after SFAS No. 133 (when they were recognized). 

The research question we address is important for a number of reasons. First, from a 
standard-setting perspective, recognition and disclosure are viewed as alternative accounting 
treatments. Therefore, whether recognition versus disclosure affects users’ decisions has 
been an issue of considerable interest to standard setters, practicing professionals, and 
academic researchers. In efficient markets with minimal information-processing costs, one 
would not expect the accounting treatment (i.e., recognition versus disclosure) to matter. 
However, if there are (1) costs of processing information (Barth et al. 2003), (2) systematic 
biases in how investors process information, such as limited attention (Hirshleifer and Teoh 
2003), or (3) differences in perceived reliability of recognized versus disclosed items, this 
choice can matter to users. 

Second, although accounting for derivatives has been one of the most widely debated 
issues among academics and practitioners, to our knowledge, no prior study examines 
explicitly whether investor valuation of derivatives depends upon recognition versus disclo- 
sure in the pre-SFAS No. 133 period and no study has examined whether SFAS No. 133 
has affected investor valuation of derivatives. 

We perform two complementary tests of differences in investor valuation of recognized 
versus disclosed derivative instruments on two samples of bank holding companies. The 
first test examines whether recognized derivative instruments are valued differently from 
disclosed derivative instruments for a sample of banks that simultaneously hold both rec- 
ognized and disclosed derivative financial instruments for risk management purposes in a 
pre-SFAS No. 133 period. We find that valuation coefficients are significantly greater than 
zero for recognized derivative instruments but not for disclosed derivative instruments. 

The second test examines whether there is a difference between valuation of recognized 
derivatives in the post-SFAS No.133 period and valuation of disclosed derivatives in the 
pre-SFAS No. 133 period. We find that the valuation coefficients on the recognized deriv- 
atives (in the post-SFAS Мо.133 period) are significantly positive, whereas the valuation 
coefficients on the disclosed derivatives (in the pre-SFAS No. 133 period) are not signifi- 
cantly different from zero. 

We contribute to two streams of prior research. First, a number of prior studies inves- 
tigate implications of recognition versus disclosure of other accounting items (e.g., stock 


ra 


Banks hold derivative financial instruments to meet customer demands, to trade these products for their own 
accounts (for trading purposes), and to manage risk (for nontrading purposes). The focus of this paper is on 
derivative financial instruments held for risk management purposes. We uss “risk management derivative finan- 
cial instruments" and ''derivative financial instruments” interchangeably in the rest of the paper. All trading 
derivatives are already recognized at fair value with any realized or unrealized gains or losses included in current 
earnings. See the Appendix for examples of how banks in our sample define their trading and nontrading 
derivative activities in footnotes to their financial statements. 
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options and post-retirement benefits). Typically, these studies use a setting where some 
firms disclose while others recognize (e.g., Aboody 1996), or where a new standard requires 
recognition of a previously disclosed item (e.g., Davis-Friday et al. 1999). The studies in 
the first group are subject to self-selection bias because firms self-select into recognizers 
or disclosers (Bernard and Schipper 1994). One potential problem with the studies in the 
second group is that the amount and/or nature of the information reported about the item 
is rarely held constant because the same firm could be disclosing and recognizing at dif- 
ferent points in time (Bernard and Schipper 1994). 

Our pre-SFAS No. 133 tests employ a within-firm design that mitigates these typical 
problems in recognition versus disclosure studies because banks could report both recog- 
nized and disclosed risk management derivatives at the same point in time. Our pre-SFAS 
133 versus post-SFAS No. 133 analysis, like other studies that examine effects of changes 
in accounting standards, is potentially subject to the second problem. However, this problem 
is likely to be considerably less serious in our setting because SFAS No. 133 explicitly 
requires derivative fair values to be measured using the same valuation guidance and tech- 
niques as those used in the pre-SFAS No. 133 period. In other words, the amount and/or 
nature of information in the post-SFAS No. 133 period is likely to be very similar to the 
amount and/or nature of information in the pre-SFAS No. 133 period. 

In addition to these general design advantages offered by the setting we use, our paper 
specifically differs from other recognition versus disclosure studies in another important 
respect. Those studies employ a “levels” analysis only, whereas we provide consistent 
evidence using both “changes” and “levels” analyses. Because the “changes” specification 
is less subject to omitted variables bias, our evidence is more robust and allows stronger 
inferences to be made about recognition versus disclosure. Our study also differs from prior 
recognition versus disclosure studies by examining the same research question using two 
different samples with two different research designs, i.e., pre-SFAS No. 133 analysis, and 
pre- versus post-SFAS No. 133 analysis. Second, our study is related to the line of research 
that examines the value-relevance of disclosed information about financial instruments. 
While prior studies on fair value disclosures of derivatives generally find that the disclosures 
are not value-relevant (e.g., Barth et al. 1996; Eccher et al. 1996; Nelson 1996), using 
disclosures required by SFAS. No. 119 (FASB 1994), Venkatachalam (1996) finds that 
banks' derivative fair value disclosures are value-relevant. However, because his study is 
designed to test the valuation implications of derivative fair values rather than recognition 
versus disclosure of these amounts, he aggregates fair values of disclosed and recognized 
derivative financial instruments and restricts their coefficients to be the same. We allow the 
coefficient estimates on recognized and disclosed risk management derivatives to differ and 
document significant differences. Furthermore, we provide evidence on whether SFAS No. 
133 has had an impact on the value-relevance of derivatives. 

While our findings are consistent with recent limited attention or costly information- 
processing models, an alternative explanation for our findings is that there are differences 
in perceived reliability of recognized and disclosed derivatives due to potentially higher 
standards imposed by audit firms and management for recognized items than for disclosed 
items.? 

The rest of this paper is organized as follows: Section II describes the institutional 
background and financial reporting environment related to derivative financial instruments 


2 For example, Libby et al. (2005) find in an experimental setting that “audit partners require greater correction 
of misstatements in recognized amounts than in equivalent disclosed amounts." 
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before and after the adoption of SFAS No. 133. Section Ш presents the research design 
and empirical methods employed in our study. Section IV presents the results of our em- 
pirical analysis. Section V provides a summary and conclusion. 


II. INSTITUTIONAL BACKGROUND 
Accounting for Derivative Financial Instruments Prior to SFAS No. 133 


Prior to SFAS No. 133, accounting for derivatives was governed by (1) SFAS Nos. 52 
and 80 (FASB 1981, 1984), and (2) consensus positions of the Emerging Issues Task Force 
(EITF). Additional accounting guidance for banks was provided by the Audit and Account- 
ing Guide for Banks and Savings Institutions (AICPA 2000)? Accounting for derivative 
instruments that were not specifically covered by SFAS Nos. 52 and 80 developed largely 
by analogy to these standards. The accounting treatment for derivative instruments under 
the requirements of SFAS No. 52 and 80 could be categorized into: (1) hedge accounting, 
(2) synthetic alteration (or settlement accounting), and (3) speculative accounting. 

If a derivative instrument was being used to hedge an existing asset, liability, or antic- 
ipated transaction, then it was accounted for in a manner consistent with the requirements 
of hedge accounting. On the other hand, if a derivative instrument was being used to change 
the nature of one financial instrument to that of another financial instrument (such as an 
interest-rate swap changing floating-rate debt to fixed-rate debt), it was accounted for in 
accordance with the requirements of synthetic alteration (or settlement accounting). 

Under both hedge and settlement accounting, the accounting treatment for a derivative 
instrument was determined by the accounting treatment for the underlying asset or liability 
to which the derivative instrument is designated. Specifically, a derivative instrument was 
recognized at fair value on the balance sheet as an asset or a liability if the related under- 
lying asset or liability was also carried at fair value or lower of cost or fair value (such as 
a foreign-exchange forward hedging the foreign currency risk of an available-for-sale se- 
сигісу). On the other hand, if a derivative instrument was hedging an asset or liability 
carried at historical or amortized cost, it was not recognized at fair value on the balance 
sheet (such as an interest rate swap hedging a variable-rate debt). Depending on the un- 
derlying asset, liability, or transaction, either no entry was made or the derivative instrument 
was carried at historical cost, which often equaled zero. 

If a derivative instrument was used for trading or speculative purposes, it was accounted 
for under speculative accounting, which required the derivative instrument to be recognized 
at fair value. 

To summarize, depending on the accounting for the underlying asset, liability, or trans- 
action, certain derivatives were reported in financial statements at fair value under the 
accounting rules for derivatives and hedging activities prior to SFAS No. 133, and others 





3 Specifically, EITF Issue 84-07 on Termination of Interest Rate Swaps, ЕПЕ Issue 84-36 on Interest Rate Swap 
Transactions, ЕТТЕ Issue 88-08 on Mortgage Swaps, EITF Issue 90-17 on Hedging Foreign Risk with Purchased 
Options Contracts, and EITF Issue 91-04 on Hedging Foreign Currency Risks with Complex Options were the 
EITF issues that provided guidance for accounting for derivative financial instruments (EITF 1984a, 1984b, 
1988, 1990, 1991). 

* Regarding these types of derivative instruments, SFAS No. 80 (in paragraph 5) states: “If an enterprise includes 
unrealized changes in the fair value of a hedged item in income, a change in the market value of the related 
futures contract shall be recognized in income when the change occurs. The same accounting shall be applied 
to a futures contract that hedges an anticipated transaction if the asset to be acquired or liability to be incurred 
will be reported at fair value subsequent to acquisition or incurrence.” 
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were not? This created a financial reporting environment in which it was possible for a 
bank to report some of its risk management derivatives on the balance sheet at fair value, 
and to just disclose the rest of its risk management derivatives, ie., hold recognized and 
disclosed derivative instruments simultaneously. 

Requirements under SFAS No. 107 (FASB 1991) made fair values of all derivative 
financial instruments available to investors, while SFAS No. 119 required banks to clearly 
indicate whether the aggregate fair value of the derivatives portfolio represents a net asset 
or a net liability position and to provide disaggregated information on carrying amounts, 
fair values, and contractual amounts of derivatives classified by category of instrument and 
by purpose for which they are ћејд.5 These expanded disclosures and accounting practices 
prior to SFAS No. 133 provide a natural setting for comparing the valuation implications 
of recognized and disclosed derivative fair value information. 


Accounting for Derivative Financial Instruments after SFAS No. 133 

Statement of Financial Accounting Standards No. 133, Accounting for Derivative In- 
struments and Hedging Activities was issued in June 1998 and was effective for fiscal years 
beginning after June 15, 2000." SFAS No. 133 requires all derivatives, without exception 
and regardless of the accounting treatment for the underlying asset, liability, or transaction, 
to be recognized in the balance sheet as either liabilities or assets. However, accounting for 
the gain or loss on a derivative instrument depends on how it is used. Depending upon 
whether it is a fair value or cash flow hedge, the gain or loss is recognized either in current 
earnings or comprehensive income. While SFAS Nos. 105 (FASB 1990), 107, and 119 aim 
at improving risk-related disclosures for financial instruments, including derivatives, SFAS 
No. 133 constitutes the recognition stage. 


Ш. RESEARCH DESIGN 

Overview 

We use both levels and changes specifications to test the value-relevance of recognized 
and disclosed fair values of derivative instruments. The models assume that the relation 
between derivative value and firm value is cross-sectionally uniform. Furthermore, there 
may be omitted variables if there are unrecognized assets or liabilities, e.g., intangibles, 
which are priced by investors. We use the changes specification to help mitigate potential 
omitted variables bias (Kothari and Zimmerman 1995). 


Pre-SFAS No. 133 Analysis—Levels Model 


- Ош cross-sectional equity framework follows prior research (e.g., Ahmed and Takeda 
1995; Barth et al. 1996; Venkatachalam 1996). The levels specification expresses firm value 





ха 


SFAS No. 119 and SFAS Мо. 133, in addition to SFAS No. 80, acknowledge differential reporting treatment 
for risk management derivatives as well. SFAS No. 119 (in paragraph 48) notes: "ће Board believes that some 
derivative financial instruments may be measured at fair value but not considered held or issued for trading 
purposes. For example, some hedging activities may entail the use of foreign exchange contracts that fail to 
meet the criteria for deferral accounting under FASB Statement No. 52, Foreign Currency Translation. Those 
derivative financial instruments, although measured at fair value, may be categorized as held or issued for 
purposes other than trading" (emphasis in original). SFAS No. 133 (in paragraph 234) also notes: “Under the 
varied accounting practices that existed before the issuance of this Statement, some derivatives were recognized 
in financial statements, others were not." 

The initial year of application for SFAS No. 107 was 1991, and the initial year of application for SFAS No. 
119 was 1994. 

SFAS No. 133 was originelly effective for the fiscal years beginning after June 15, 1999. As a result of receiving 
several complaints from business entities, the FASB delayed the effective date to fiscal years beginning after 
June 15, 2000. 


© 
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as a function of the fair values of recognized and disclosed derivatives, on-balance-sheet 
non-derivative assets and liabilities, and proxies for intangibles. 

In order to mitigate the effects of potential omitted variables, we add two control 
variables to the model. First, consistent with Barth et al. (1996), we include non»erforming 
loans (NPL) as a control variable that contains information on default risk incremental to 
allowance for loan loss reserves. Second, based on prior banking research (Flannery 1982; 
Flannery and James 1984), we include CORE, which is defined as domestic deposits minus 
time deposits, as a proxy for core deposit intangible assets. Inclusion of CORE as a control 
variable is based on the assumption that long-term relationships with depositozs add sig- 
nificant value to banks and this value is not reflected in reported financial statement 
numbers. 

We permit the intercept in Equation (1) to vary by year and bank in order to account 
for period-specific economic effects and firm-specific effects that are not captured by the 
explanatory variables. We use per share data in our cross-sectional levels models and check 
the sensitivity of our results to the use of book value of equity as a deflator. 

The regression model to test whether there are differences in investor valuation of 
recognized and disclosed derivatives in the pre-SFAS No. 133 period is as follows: 


жы, Хог FVHA FVFL 
Y, t 2, Op, ВНС, + o, CSO, fra, C50, 
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where T is the number of years in the sample period; Y, equals 1 if the observation belongs 
to year t, and 0 otherwise; K is the number of bank holding companies in the sample; and 
BHC, equals 1 if the observation belongs to bank holding company i, and 0 otherwise. For 
firm i at time t, MVE, is market value of equity at the end of year t and CSO, is the number 
of common shares outstanding. FVFA, and FVFL, are the aggregated fair values cf financial 
assets and financial liabilities other than fair values of risk management derivativ2 financial 
instruments, respectively. FVFA, is computed as the sum of the fair values of cash and 
equivalents, investment securities, net loans, and other financial assets excluding risk man- 
agement derivative instruments, and FVFL,, is computed as the sum of the fair values of 
deposits, short-term and long-term debt, and other financial liabilities excluding risk man- 
agement derivative instruments. NFA, and NFL, аге the aggregated book values of nonfi- 
nancial assets and nonfinancial liabilities, respectively. RNDP and DNDP are the net fair 
values of recognized and disclosed derivatives, respectively. NPL, is nonperforming loans, 
and CORE,, is domestic deposits minus time deposits. 

| Coefficients of interest in Equation (1) are a, and a. Significant positive values for a, 
and a, indicate the value-relevance of fair values of recognized and disclosed derivative 
instruments, respectively. 


Pre-SFAS No. 133 Analysis—Changes Model 
In order to mitigate potential omitted variables bias, we use the following changes 
specification: 


8 We use book values of nonfinancial assets and nonfinancial liabilities because their fair values are no: observable. 
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We expect В. and Вх in Equation (2) to have significant positive values if recognized 
and disclosed fair values of derivatives are value-relevant to investors. 


Pre- versus Post-SFAS No. 133 Analysis 
In SFAS No. 133, the FASB states (in paragraph 238): 


[SFAS No. 133] increases the visibility, comparability, and understandability of the 
risks associated with derivatives by requiring that all derivatives be reported as assets 
or liabilities and measured at fair value. 


Our pre- versus post-SFAS No. 133 analysis is designed to test whether mandatory 
recognition of derivative instruments at fair value, as required by SFAS No. 133, has had 
the desired impact. 

We use both levels and changes models to examine the value-relevance of derivative 
financial instruments, whose fair values are disclosed prior to the adoption of SFAS No. 
133 and recognized subsequent to the adoption of SFAS No. 133. While these models are 
generally similar to the models in Equations (1) and (2), they differ from those models in 
two important ways. First, the coefficients on the explanatory variables are permitted to 
differ across the pre- and post-SFAS No. 133 periods. Second, instead of including both 
recognized and disclosed derivative fair values as independent variables, these models in- 
clude only disclosed derivative fair values in the pre-SFAS No. 133 period and only rec- 
ognized derivative fair values in the post-SFAS No. 133 period. In both он derivative 
fair values are estimated following SFAS No. 107.? 

The levels and changes models are specified as follows: 
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9 SFAS Мо. 133 (in paragraoh 243) states: “This Statement relies on the valuation guidance provided in Statement 
No. 107, which most entities have been employing for several years." 


The Accounting Review, May 2006 


574 | Ahmed, Kilic, and Lobo 



































a = es » бы, + 5 би, ВИС, + 8, MVE 
+ 8,4POST * MVE QE. d + 5, РОЗТ * МЕ 
a = А. + 8,,POST * iE. + By, we (4) 
+ 8,,POST + му + 8, й + &POST + ADER, 
+ ва иу so, REN 
+ 5, POST * ACORES + 
и 


where DER denotes net fair value of disclosed derivative financial instruments (ОМЮР) 
prior to the adoption of SFAS No. 133 and net fair value of recognized derivative financial 
instruments (RNDP) after the adoption of SFAS No. 133; T is the number of years in the 
sample period; Y, equals 1 if the observation belongs to year t, and 0 otherwise; К is the 
number of bank holding companies in the sample; and POST equals 1 if the observation 
belongs to the post-SFAS No. 133 period, and 0 otherwise. In addition to DER, we also 
interact FVFA, FVFL, NFA, NFL, NPL, and CORE with POST to allow the coefficients on 
these variables to vary across рге- and post-SFAS No. 133 periods. 

Note that moving from a disclosure regime to a recognition regime may affect the 
relative valuation of disclosed versus recognized derivatives through the effect of recogni- 
tion on capital adequacy ratios. However, these effects are unlikely to be significant if 
recognized derivatives are used to hedge assets or liabilities that are marked to market 
because any potential effects of recognized derivatives on Tier 1 capital will be offset by 
the losses or gains on the underlying assets or liabilities. Further, even if recognition results 
in increasing the volatility of capital ratios, the higher risk would bias against finding higher 
valuation coefficients for recognized items (Hodder et al. 2002). 


IV. EMPIRICAL ANALYSIS 
Sample Selection and Descriptive Statistics 


The sample used for our tests comparing recognized and disclosed derivative instru- 
ments in the pre-SFAS No. 133 period consists of bank holding companies that (1) use 
derivatives for nontrading purposes between 1995 and 2000, (2) are publicly traded, (3) 
have 10-K reports in LexisNexis, and (4) have annual observations in the Bank Regulatory 
Database from 1995 to 2000.!! The sample period is from 1995 to 2000 because both SFAS 
Nos. 107 and 119, which required expanded disclosures on fair values of derivative instru- 
ments and purposes of derivative usage, were effective during this period. This selection 
procedure yields a final sample of 146 bank holding companies. Of these, 58 bank holding 


10 See the Joint Release entitled “Interim Regulatory Reporting and Capital Guidance on SFAS No. 133, Accounting 
for Derivative Instruments and Hedging Activitles" issued by the OCC, Board of Governors of the Federal 
Reserve System, FDIC, and OTS dated December 29, 1998. 

п The Bank Regulatory Database provides accounting data for bank holding companies, commercial banks, savings 
banks, and savings and loans institutions. The Bank Regulatory Database collects financial data from the Federal 
Reserve Bank of Chicago. The data is based on bank holding companies’ FRY-9 reports. 
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companies have both recognized and disclosed derivative financial instruments held for risk 
management purposes, and 88 have only disclosed derivative financial instruments in the 
pre-SFAS No. 133 pericd. 

December 31 stock price information for the bank holding companies is obtained from 
CRSP.” Fair value information for financial assets and liabilities including derivatives and 
notional value information for derivatives is collected from banks' 10-K reports in 
LexisNexis. We confirm these notional and fair value numbers with the FRY-9 reports in 
the Bank Regulatory Database. 

Table 1 reports descriptive statistics for bank holding companies that have both rec- 
ognized and disclosed risk management derivative financial instruments, referred to as the 
first group, and bank holding companies that only have disclosed derivatives, referred to as 
the second group. 

A comparison of the first and the second group reveals that banks in the first group 
are considerably larger in terms of total assets than banks in the second group, and net 
derivative positions are economically significant at least for some banks. The net fair value 
of recognized derivatives ranges from — 7.44 percent to 7.35 percent of book value of equity 
and net fair value of disclosed derivatives ranges from —11.02 percent to 14.59 percent of 
book value of equity for the first group, while net fair value of disclosed derivatives ranges 
from —18.31 percent to 23.76 percent of book value of equity for the second group. Further, 
for the first group, net fair value of disclosed derivatives ($57.35 million) is significantly 
higher, on average, than the net fair value of recognized derivatives ($3.24 million). The 
two groups of banks are similar in terms of the composition of their risk management 
derivative portfolio. For both groups, interest rate derivative contracts constitute the major 
portion of the overall risk management derivative portfolio. For the first group, in terms of 
notional amounts, interest rate derivative contracts constitute approximately 90 percent of 
the overall portfolio, while this figure is 91 percent for the second group. Descriptive 
statistics in Panel A of Table 1 also reveal that the compositions of the recognized and 
disclosed derivative instrument portfolios are similar. Interest rate contracts constitute 84.77 
percent of the total notional amount of derivative contracts for recognized derivative in- 
struments, and 90.07 percent for disclosed derivative instruments. 

The sample for our second test, comparison of valuation of derivative instruments in 
the post-SFAS No. 133 period relative to valuation in the pre-SFAS No. 133 period, consists 
of 82 out of 88 bank holding companies, which have only disclosed derivative financial 
instruments before the adoption of SFAS No. 133 and which have the required data in the 
post-SFAS No. 133 period.'? Our sample period is from 1995 to 2000 before the adoption 
of SFAS No. 133, and from 2001 to 2004 after the adoption. Because these 82 banks 
constitute approximately 93 percent of the second group of banks described in this section, 
their descriptive statistics are similar to those of the second group and, for this reason, are 
not reported separately. | 

When comparing valuation coefficients on derivative instruments before and after the 
adoption of SFAS No. 133, we need to rule out alternative explanations for the difference 
other than recognition versus disclosure. Considering that requirements of SFAS No. 133 
have drastically changed the accounting for derivative instruments, there may have been a 
significant change in the extent of derivative usage or in the way derivatives are used. 


12 Using March 31 prices or returns cumulated to March 31 does not alter our inferences. 
13 Six bank holding companies are excluded from the pre- versus post-SFAS Мо. 133 analysis because they either 
merged with or were acquired by other bank holding companies. 
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Panel B of Table 1 presents descriptive statistics regarding the extent of derivative usage 
and the types of derivatives used. Our tests reveal that there is no significant change in the 
extent of derivative usage and composition of the derivative instrument portfolio. In terms 
of notional amounts, interest rate derivatives constitute a major portion of the entire port- 
folio (93.34 percent and 94.21 percent of total notional amount of derivative contracts 
before and after SFAS No. 133, respectively). The descriptive statistics also suggest that 
there is no significant change in the extent of derivative usage after SFAS No. 133 relative 
to before SFAS No. 133. The total notional amount of derivative contracts equals 9.21 
percent (58.92 percent) and 8.88 percent (63.08 percent) of total assets (market value of 
equity) before and after SFAS No. 133, respectively. These statistics suggest that any dif- 
ference in the valuation of derivative financial instruments before and after SFAS No. 133 
is unlikely to be attributable to change in the extent of derivative usage or composition of 
the derivative instrument portfolio. 


Value-Relevance Tests of Recognition and Disclosure for Pre-SFAS No. 133 Period 

Panel A of Table 2 presents estimation results for the cross-sectional levels model 
presented in Equation (1). These results are obtained using bank holding companies with 
both recognized and disclosed risk management derivative instruments. In order to control 
for heteroscedasticity and autocorrelation, we report t-statistics based on Newey and West’s 
(1987) heteroscedasticity and autocorrelation corrected covariance estimates in the remain- 
ing tables.” 

The results suggest that net fair value of recognized derivatives (RNDP) has significant 
explanatory power and is value-relevant. The coefficient оп RNDP, аз, is positive, as ex- 
pected, and significantly different from zero (t-statistic = 3.03). The coefficient on DNDP, 
as is not statistically different from zero at conventional levels (t-statistic = 0.91). This 
suggests that disclosed derivative fair values of banks have no power in explaining cross- 
sectional variation in banks’ market values of equity. The coefficients on other balance sheet 
variables (FVFA, FVFL, NFA, and NFL) have the predicted signs and are significantly 
different from zero. The coefficients on the control variables (VPL and CORE) also have 
the predicted signs though only the coefficient on CORE is reliably different from zero. 
We also test whether the coefficient on RNDP, а, is statistically greater than the coefficient 
on DNDP, as. Our findings suggest that a, is greater than a, at the 1 percent level, sug- 
gesting that the market places a lower weight on disclosed derivatives. 

Panel B of Table 2 presents the results for the changes model for bank holding com- 
panies that have both recognized and disclosed derivatives. The results of the changes model 
are not discussed in detail because the changes model provides results very similar to the 
levels model. 

A potential explanation for not finding a significant coefficient on DNDP is that the 
tests lack power because of the relatively small sample size. To assess this, we use a 
combined sample that includes banks with both recognized and disclosed derivatives and 
banks with only disclosed derivatives prior to SFAS No. 133 and test the significance of 
the valuation coefficients. In untabulated results, consistent with the results in Table 2, we 
find that the valuation coefficients on recognized derivatives are significant, whereas the 
valuation coefficients on disclosed derivatives are not significant. 


14 We use the Newey-West fixed bandwidth, q, in our tests (Newey and West 1987). It is based on the number of 
observations in the sample, and is computed as: а = int [4(N/100)?/*], where int Г | denotes the integer part 
and N is the number of observations. 
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TABLE 2 
Tests of Value-Relevance of Recognized versus Disclosed Derivative Instrument Fair Values in 
the Pre-SFAS No. 133 Period for the Sample of Banks with Both Recognized and Disclosed 


Derivative Instruments 
Panel A: Levels Модем 
Predicted Coefficient 
Variable™ Sign Estimates t-statistic? p-value* 
FVFA T 1.379 4.321 0.001 
FVFL - — 1.375 —4.219 0.001 
NFA + 1.338 3.996 0.001 
NFL 2 -1.293 -4.103 0.001 
RNDP + 1.270 3.025 0.001 
ЮМОР + 0.635 0.905 0.183 
NPL Е —0.459 —0.781 0.218 
CORE + 0.476 5.086 0.001 
Adj. В? 0.882 
No. of Obs. | 342 
Panel B: Changes Model! . 
Predicted Coefficient 
Variable^ — — _ Sign | Estimates t-statistic" p-value? 
AFVFA + 0.678 3.621 0.001 
AFVFL - —0.667 —3.333 0.001 
АМЕА + 0.686 3.801 0.001 
ANFL - -0.668 -3.603 0.001 
ARNDP + 0.854 2.661 0.004 
ADNDP + 0.528 0.896 0.174 
ANPL = —0.462 —2.223 0.014 
ACORE + 0.340 1.417 0.079 
Adjusted R? 0.375 
No. of Observations 284 


“The sample includes annual observations from 1995 to 2000 for banks that have both recognized and disclosed 

з For bank i at time г, MVE, = market value of equity; БУКА, = aggregate fair value of financial assets 
excluding risk management derivative instruments; FVFL, = aggregate fair value of financial liabilities : 
excluding risk management derivative instruments; NFA, = aggregate book value of nonfinancial assets; NFL, 
= aggregate book value of nonfinancial liabilities; ЮМОР, = net fair value of recognized derivative 
instruments; DNDP,, = net fair value of disclosed derivative instruments; NPL, = nonperforming loans; 
CORE, = domestic deposits minus time deposits; CSO, = number of common shares outstanding; Y, = 1 if 
the observation belongs to year і = T for Т = 1, 2,...6, and 0 otherwise; and BHC, = 1 if the observation 
belongs to bank holding company і = К for = 1, 2,...,58, and 0 otherwise. 

° The F-statistic for testing the hypothesis that борц = богд = ... Qos = 0 and the F-statistic for testing the 
hypothesis that оо, = ооз = ... ору = О indicate that the null hypothesis can be rejected at the 1 percent 
level. ` 

3 All t-statistics are based on Newey and West’s (1987) heteroscedasticity and autocorrelation consistent standard 
error estimates. 1 

° АП p-values are based on one-tailed t-tests when the coefficient sign is predicted, and based on two-tailed 
t-tests otherwise. 


(continued on next page) 
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TABLE 2 (Continued) 


f The sample includes annual observations from 1995 to 2000 for banks that have both recognized and disclosed 

є For bank i during year t, AMVE, = change in market value of equity; АРУҒА, = change in aggregate fair 
value of financial assets excluding risk management derivative instruments; AFVFL, = change in aggregate fair 
value of financial liabilities excluding risk management derivative instruments; ARNDP, = change in net fair 
value of recognized derivative instruments; ADNDP, = change in net fair value of disclosed derivative 
instruments; ANPL, = change in nonperforming loans; АСОКЕ, = change in domestic deposits minus time 
deposits; Y, = 1 if the observation belongs to year t = Т for T = 1, 2,...,5, and 0 otherwise; and BHC, = 1 if 
the observation belongs to bank holding company i = К for К = 1, 2,...,58, and 0 otherwise. 

^ The F-statistic for testing the hypothesis that Ву; = Богд = ~ = Bors = 0 and the F-statistic for testing the 
hypothesis that Po = Boa =... = Bos = 0 indicate that the null hypothesis can be rejected at the 1 percent 
eve 


To summarize, our results indicate that, in the pre-SFAS No. 133 period, recognized 
fair values of risk management derivative instruments are reliably relevant in explaining 
equity valuation of bank holding companies, whereas disclosed fair values of risk manage- 
ment derivative instruments are not.? 


Value-Relevance Tests of Recognition and Disclosure for Pre- versus Post-SFAS 
No. 133 Periods 


The analysis in this section uses the banks that have only disclosed derivatives in the 
pre-SFAS No. 133 period. Panel A of Table 3 presents estimation results of the levels model 
presented in Equation (3). These results suggest that, for the 82 bank holding companies 
in our sample, the coefficient on net fair value of derivative financial instruments disclosed 
prior to SFAS No. 133 (A) is not significantly related to market value of equity (t-statistic 
= 0.86). On the other hand, net fair value of derivative financial instruments recog- 
nized after SFAS No. 133 is significantly. positively related to market value of equity (t- 
statistic for №; + А, = 2.18). The positive and significant coefficient on POST*DER 
(t-statistic = 4.93), A, suggests that derivative fair values, which were disclosed prior to 
SFAS No. 133, are given more weight in valuation after SFAS No. 133, which mandates 
their recognition. 

The coefficients on the other balance sheet and control variables exhibit the predicted 
signs and are significant at conventional levels. The coefficients on interaction terms, except 
for POST*DER, are not reliably different from zero, suggesting that other balance sheet 
and control variables are given the same weight in valuation in both the pre- and the post- 
SFAS No. 133 periods. 

The estimation results for the changes models, presented in Panel B of Table 3, are 
consistent with the results for the levels models. They show that the coefficient on changes 
in net fair values of derivative financial instruments disclosed prior to SFAS No. 133, 6,, 
is not significant in explaining changes in banks’ equity values (t-statistic = 0.45), whereas 
the coefficient on changes in net fair values of derivative financial instruments recognized 
after SFAS No. 133, $, + 8, is significant (t-statistic = 1.92). The coefficient on 
POST*ADER, Хо is positive and statistically significant (t-statistic = 3.11), suggesting that 
the market places more weight on derivative fair values when they are recognized. 


15 These findings are in contrast to Venkatachalam (1996), who finds that derivative fair value disclosures are value 
relevant, but are consistent with Barth et al. (1996), Eccher et ‘al. (1996), and Nelson (1996), who report that 
fair values of banks’ off-balance-sheet derivative financial instruments are not significantly related to equity 
values. 
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TABLE 3 | 
Tests of Value-Relevance of Derivative Instrument Fair Values in the Post- and 
Pre-SFAS No. 133 Period for the Sample of Banks with Only Disclosed Derivative 
Financial Instruments in the Pre-SFAS No. 133 Period 


Panel A: Levels Model* 


Predicted Coefficient 
Variable" — _ Sign. Estimates t-statistic* p-value* 
FVFA + 1.266 5.444 0.001 
POST*FVFA ? —0.054 —0.193 0.847 
FVFL - -1.238 -5.212 0.001 
POST*FVFL ? 0.026 0.101 0.920 
NFA + 1.004 5.224 0.001 
POST*NFA ? —0.007 —0.910 0.363 
NFL - -0.961 -5.303 0.001 
POST*NFL ? 0.004 1.223 0.222 
DER + 0.673 0.863 0.194 
POST*DER ? 0.787 4.928 0.001 
NPL - —0.383 —4.676 0.001 
POST*NPL ? 0.179 1.263 0.207 
CORE t 0.135 2.359 0.009 
POST*CORE ? -0.046 —0.402 0.688 
Adjusted R? 0.853 
No. of Obs. 820 
t (As + А) + 2181 ` 0.015 
Panel B: Changes Model‘ 

Predicted Coefficient 

Variable?” Sign Estimates t-statistic? p-value" 
AFVFA + 0.618 3.099 0.001 
POST*AFVFA ? —0.134 —0.936 0.350 
AFVFL - —0.579 —3.802 0.001 
POST*AFVFL ? 0.149 0.888 0.375 
ANFA T 0.754 2.554 0.005 
POST*ANFA ? —0.076 —2.718 0.007 
ANFL Е —0.700 —2.301 0.011 
POST*ANFL 7 0.057 2.277 0.023 
ADER + 0.256 0.453 0.325 
POST*ADER ? 0.534 3.112 0.002 
ANPL - -0.110 -0.850 0.198 
POST*ANPL ? 0.079 0.776 0.438 
ACORE + 0.121 1.320 0.094 
POST*ACORE ? 0.153 0.691 0.490 
Adjusted R? 0.411 
No. of Obs. 656 
t (8; + 89 + 1.920 0.028 


(continued on next page) 
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* The sample includes annual observations from 1995 to 2004. 

* For bank і at time t, MVE, = market value of equity; FVFA, = aggregate fair value of financial assets 
excluding risk management derivative instruments; FVFL, = aggregate fair value of financial liabilities 
excluding risk management derivative instruments; NFA, = aggregate book value of nonfinancial assets; NFL, 
= aggregate book value of nonfinancial liabilities; DER, = net fair value of disclosed derivative instruments if 
the observation belongs to the pre-SFAS No. 133 period, or — net fair value of recognized derivative 
instruments if the observation belongs to the post-SFAS No. 133 period; NPL, = nonperforming loans; CORE, 
= domestic deposits minus time deposits; CSO, = number of common shares outstanding; Y, = 1 if the 
observation belongs to year г = T for T = 1, 2,...,9, and 0 otherwise; BHC, = 1 if the observation belongs to 
bank holding company i = К for К = 1, 2,...,82, and 0 otherwise; and POST = 1 if the observation belongs to 
post-SFAS No. 133 period, and 0 otherwise. 

еш зориро ee Ка = ... Хард = 0 and the F-statistic for testing the 
hypothesis that Ма = Маг = Маз = 0 indicates that the null hypothesis can be rejected at the 1 percent 
level. 

* All t-statistics are based on Newey and West's (1987) heteroscedasticity and autocorrelation consistent standard 
error estimates. 

* All p-values are based on one-tailed t-tests when the coefficient sign is predicted, and based on two-tailed 
t-tests otherwise. 

f The sample includes annual observations from 1995 to 2004. 

£ Variable definitions: For bank i during year г, AMVE, = change in market value of equity; АҒУҒА, = change 
in aggregate fair value of financial assets excluding risk management derivative instruments; AFVFL, — change 
in aggregate fair value of financialiliabilities excluding risk management derivative instruments, or ADER, 

— change in net fair value of disclosed derivative instruments if the observation belongs to the pre-SFAS No. 
133 period; — change in net fair value of recognized derivative instruments if the observation belongs to the 
post-SFAS No. 133 period; ANPL, = change in nonperforming loans; ACORE, = change in domestic deposits 
minus time deposits; Y, = 1 if the observation belongs to year t = Т for T = 1, 2,...,7, and 0 otherwise; BHC, 
= 1 if the observation belongs to-bank holding company i = К for К = 1, 2,...,82, and 0 otherwise; and POST 
= ] if the observation belongs to post-SFAS No. 133 period, and 0 otherwise. 


З The F-statistic for testing the hypothesis that ё, = 8012 = 807 = 0 and the F-statistic for testing the 
hypothesis that 8; = 85; =... = б = 01 indicate that the null hypothesis can be rejected at the 1 percent 
level. 


The coefficients on the other balance sheet variables have the predicted signs and are 
significant at.conventional levels. The coefficients on two interaction terms, POST*ANFA 
and POST*ANFL, are significant, suggesting that nonfinancial balance sheet accounts are 
not given the same weight in valuation in the pre- and the post-SFAS No. 133 periods. 
Unlike the results for the levels specification, the coefficients on the control variables, ANFL 
and ACORE, are insignificant (t-statistic = —0.85), and marginally significant (t-statistic 
= 1.32), respectively. 

Our analysis in this section may be difficult to interpret if banks change their risk 
management strategies significantly with the adoption of SFAS No. 133. Specifically, a 
change, from. Фе pre-SFAS No. 133 to the post-SFAS No. 133 period, in how risk man- 
agement derivatives are used might cloud our findings. While there has been speculation 
on whether and how SFAS No. 133 may affect firms’ risk management strategies, empirical 
evidence shows no significant effect on banks.! According to a survey by the Association 
for Financial Professionals (AFP 2002) over 70 percent of the respondents state that SFAS 
No. 133 has had no impact on the way they use derivative instruments. Results from a 
similar survey conducted by Treasury & Risk Management (2001) also indicate that almost 


1$ For example, in his article in CFO Magazine, Radigan (2001) says, "banking companies that already have 
rigorous accounting standards in place are finding life under SFAS No. 133 may not be that different from what 
came before." 
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70 percent of the derivative-users stated that their derivatives strategies have not changed 
in light of SFAS No. 133. 

We conduct two tests to examine whether banks € their bedging strategies as 
they moved from a disclosure environment to a recognition environment. For derivatives 
held for risk management purposes, gains and losses on these contracts are expected to be 
negatively associated with gains and losses on hedged assets and liabilities. Following 
Venkatachalam (1996), we test the existence of such a relation for the sample of banks 
used in the pre- versus post-SFAS No. 133 analysis. We analyze the correlation coefficients 
between fair value gains and losses on risk management derivatives (GLDER) and fair value 
gains and losses on-balance-sheet assets and liabilities (GLBS). Our definition of on- 
balance-sheet assets and liabilities includes investment securities, net loans, deposits, and 
long-term debt. These financial assets and liabilities are inherently exposed to interest 
rate and exchange rate risk, thus constituting most commonly hedged balance sheet items 
(Venkatachalam 1996). Fair value gains and losses are computed as the.sum of realized 
gains and losses and changes in unrealized gains and losses (Barth 1994; Venkatachalam 
1996). Our analysis (not reported) reveals that the correlations between GLDER and GLBS 
are —0.33 and —0.30 in the pre-SFAS No. 133 and the post-SFAS No. 133 periods, re- 
spectively. The negative correlations suggest that the derivatives we examine are, in aggre- 
gate, used for offsetting risk exposures inherent in on-balance-sheet financial assets and 
liabilities both prior to and after SFAS No. 133. 

In addition to analyzing the direction of the relationship, we also test whether the 
magnitude of the correlation changed from the pre-SFAS No. 133 to the:post-SFAS No. 
133 period. Using the procedure developed by Fisher (1921), we test whether the correlation 
between GLDER and GLBS during the pre-SFAS No. 133 period is different from the 
correlation during the post-SFAS No. 133 period. We find that the correlations are not 
different (z-statistic = —0.44, and p-value = 0.66) across the two periods, suggesting that 
the banks in our sample are unlikely to have changed their hedging strategies following 
SFAS No. 133. 


Self-Selection Correction 

A potential problem with our tests based on the sample of banks that held both rec- 
ognized and disclosed derivatives or banks that held only disclosed derivatives pre-SFAS 
No. 133 is that banks could self-select into either group. We control for this self-selection 
bias using the two-stage estimation procedure developed by Heckman (1976, 1979) and 
Lee (1976, 1978). We estimate the following probit model to explain the.classification of 
banks into the two groups: 


К, = ф + ф,АЕЅ, + pa SIZE, + b,COD, + PCAP, + $ ММ, 
+ VJRREXP, + в, (5) 


where R,, equals 1 if bank i holds risk management derivative financial instruments rec- 
ognized at fair value, and 0 otherwise; AFS, is fair value of available-for:sale securities 
scaled by total assets; SIZE, is natural logarithm of total assets; COD, is interest expense 
on short- and long-term debt, subordinated notes, and debentures scaled by the sum of debt, 
subordinated notes, and debentures; CAP, is equity capital scaled by total assets; ММ, is 
net interest income scaled by total assets; and IRREXP, is one-year maturity. gap scaled by 
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total assets. These variables proxy for the likelihood of banks holding recognized deriva- 
Чуез.!7 The probit model classifies*82.46 percent (79.92 percent) of the observations cor- 
rectly for the first (second) group, indicating that the probit model has high classificatory 
power.!* In untabulated results, we find that including the inverse-Mills ratio in our levels 
and changes models does not alter any of our inferences. 


Additional Sensitivity Analyses 

We perform a number of additional tests and sensitivity analyses. First, we repeat our 
tests using annual year-by-year regressions. Although we only have a small number of 
annual observations, we find the coefficient on RNDP to be statistically significant in five 
of the six years (1995, 1996, 1997, 1999, 2000), and the coefficient on DNDP to be insig- 
nificant in all six years. Second, to reduce the potential bias in coefficient standard error 
estimates that may result from cross-sectional correlation, we compute a t-statistic from the 
regression coefficient estimates obtained from annual regressions (Bernard 1987). Using 
this alternative t-statistic does not alter any of our inferences. Third, for both the pre-SFAS' 
No. 133 and the pre- versus post-SFAS No. 133 analysis, we examine the sensitivity of our 
results to the choice of deflator by repeating levels estimations using variables scaled by 
beginning book value of equity in place of number of shares outstanding. We obtain similar 
results as those reported except that the coefficient on recognized derivative fair values 
following SFAS No. 133, A, + Хе, is also not different from zero. However, we find that 
recognized derivatives are significantly valued when the post-SFAS No. 133 period is de- 
fined as the period 2002—2004. Finally, to rule out the possibility that our pre- versus post- 
SFAS No. 133 results are driven by an increase in value-relevance of recognized derivatives 
for some reason other than recognition versus disclosure, we conducted a pre- versus post- 
SFAS No. 133 analysis for the banks that had both recognized and disclosed derivatives 
prior to SFAS No. 133. Our results (not reported) show that value-relevance of recognized 
derivatives has not changed from the pre- to the post-SFAS No. 133 period. 


Possible Differences in Reliability 

While our findings are consistent with recent limited attention or costly information- 
processing models, an alternative explanation for our findings is that there are differences 
in the reliability of recognized and disclosed derivatives. However, two factors might mit- 
igate the reliability differences between recognized and disclosed fair values of derivatives. 

First, all derivative contracts are traded either on exchanges, or in over-the-counter 
(OTC) markets. For exchange-traded derivatives, price verification is straightforward as they 
are traded in highly liquid markets. Although OTC markets are not as liquid as exchanges, 
they operate in a very similar manner to exchanges. Dodd (2002) notes that in all three 
types of OTC markets (traditional dealer markets, electronically brokered markets, and 
proprietary electronic dealer platforms), the trading process is either bilateral or multilateral, 
and the ability to instantaneously communicate among dealers and customers enables mar- 
ket participants to execute trades in a multilateral environment similar to exchanges as 
defined by the Commodity Exchange Act. Smith and Wilson (1996) also state that interest 
rate swaps are traded enough to obtain quotes from dealers. For these reasons, although 


U We expect the probability of holding recognized derivatives to be increasing with the amount of available-for- 
sale securities, bank size (Angbazo 1997), cost of external financing (Froot and Stein 1998), inherent interest 
rate risk exposure (Carter and Sinkey 2000; Gunther and Siems 2002; Reichert and Shyu 2002), and decreasing 
with capital adequacy ratio and bank performance (Ahmed et al. 1999). 

18 Consistent with prior studies (e.g., Chaney et al. 2004), we assess the classificatory power of the first stage 
probit model using a 0.50 cutoff. 
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not as liquid as exchanges, OTC markets are still considered liquid for derivative PUES 
such as interest rate swaps.? 

Second, observable market inputs are the major determinants of the fair vifus of 
derivatives traded in the OTC market. The Public Company Accounting Oversight Board 
(2004) states that verification of reliability of a fair value measurement depends on the 
extent to which market-based inputs, which are easily verified, and entity-specific assump- 
tions, which are the results of unobservable management expectations, are used in calcu- 
lations. Fair values of derivative contracts including interest rate swaps and foreign 
exchange forwards, which constitute over 90 percent of banks' derivatives portfolios (Office 
of the Comptroller of the Currency 2003), depend mostly on market-based inputs including 
rates, prices, and indices to generate continuous yield or pricing curves. The pricing of 
these derivatives also includes entity-specific factors, mainly counter-party default risk, to 
some extent. However, entity-specific information plays a relatively smaller role in pricing 
derivatives (Litzenberger 1992). 

While these factors mitigate reliability differences between recognized and disclosed 
derivative fair values, we cannot definitively rule out differences in perceived reliability as 
an alternative explanation because there is some evidence that auditors might impose higher 
standards for recognized information than for disclosed information (Libby et al. 2005). 


V. CONCLUSION 

This study investigates the value-relevance of banks' derivative financial instruments 
held for risk management purposes that are recognized in financial statements at fair value 
and those that are not recognized, but whose fair values are disclosed in notes to the 
financial statements. Expanded disclosures and accounting practices prior to and after SFAS 
No. 133 provide a natural setting for comparing the valuation implications of recognized 
and disclosed derivative fair value information. 

Using a sample of banks that simultaneously hold recognized and disclosed derivatives 
prior to SFAS No. 133, we find that valuation coefficients on the recognized derivatives are 
significantly positive, whereas valuation coefficients on the disclosed derivatives are not 
significantly different from zero. Further, using a sample of banks that have only disclosed 
derivatives prior to SFAS No. 133, which are recognized after SFAS No. 133, we find that 
while the valuation coefficients on disclosed derivatives were not significant, the valuation 
coefficients on recognized derivatives (after SFAS No. 133) are significantly positive. 

Our findings suggest that SFAS No. 133 has been successful in increasing the trans- 
parency and visibility of derivative financial instruments. Our evidence is also consistent 
with conjectures of practitioners as well as recent models of limited attention or costly 
information processing that suggest that investors do not pay an equal amount of attention 
to disclosed amounts relative to recognized amounts. Reliability differences between rec- 
ognized and disclosed derivative fair values might be an alternative explanation for our 
findings. 

Our results are in contrast to the prevalent implicit assumption in empirical accounting 
research that whether an item is recognized or disclosed does not matter in terms of value- 
relevance and suggest that recognition and disclosure are not substitutes. Finally, our results 
also suggest that researchers should control for potentially different pricing implications of 
recognized and disclosed information in their studies. 


19 Recent empirical studies in finance also show that the OTC derivatives markets are informationally efficient. For 
example, Gupta and Subrehmanyam (2000) document evidence of mispricing of swaps during the 1980s and 
early 1990s, and find that the mispricing is eliminated with increased liquidity starting from the mid-1990s. 
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| APPENDIX 
Disclosures from Bank Annual Reports about How Banks Define Their Trading and 
Non-Trading Derivative Activities 


From the 2000 Annual Report of Keycorp, Inc.: 


Key engages in capital market, primarily through the Key Capital Partners line of 
business. These activities encompass a variety of services. Among other things, we trade 
securities as a dealer, enter into derivative contracts (both to accommodate clients’ fi- 
nancing needs and for proprietary trading purposes), invest in privately held companies 
and conduct transactions in foreign currencies (both to accommodate clients’ needs and to 
benefit from fluctuations in exchange rates). 


DERIVATIVES USED FOR ASSET AND LIABILITY MANAGEMENT PURPOSES 
Key uses derivatives known as interest rate swaps and caps to manage interest rate risk. 
These instruments modify the repricing or maturity characteristics of specified on- 
balance-sheet assets and liabilities. For example, an interest rate cap tied to variable rate 
debt would effectively prevent the interest rate on that debt from rising above a specified 
point. . 
(paragraph omitted) 


DERIVATIVES USED FOR TRADING PURPOSES 

Derivatives that are not used for asset and liability management purposes are used for 
trading purposes. Key enters into contracts for such derivatives either to make a market 
for clients or for proprietary trading purposes. Derivatives used for trading purposes 
typically include financial futures, foreign exchange forward and spot contracts, written and 
purchased options (including currency options), and interest rate swaps, caps, and floors. 
All derivatives used for trading purposes are recorded at fair value. 


From the 2000 Annual Report of Citigroup, Inc.: 


Citigroup enters into derivative and foreign exchange futures, forwards, options, and 
swaps, which enable customers to transfer, modify, or reduce their interest rate, foreign 
exchange, and other market risks, and also trades these products for its own account. 
In addition, Citigroup uses derivatives and other instruments, primarily interest rate prod- 
ucts, as an end-user in connection with its risk management activities. Derivatives are 
used to manage interest rate risk relating to specific groups of on-balance-sheet assets and 
liabilities, including investments, commercial and consumer loans, deposit liabilities, long- 
term debt, and other interest-sensitive assets and liabilities, as well as credit card securiti- 
zations, redemptions, and sales. In addition, foreign exchange contracts are used to hedge 
non-U.S. dollar denominated debt, net capital exposures, and foreign exchange transactions. 

(paragraph omitted) 

The following table provides data on the notional principal amounts and maturities of 
end-user (non-trading) derivatives, along with additional data on end-user interest rate 
swaps and net-purchased option positions at year-end 2000 with three-month LIBOR for- 
ward rates included for reference. 
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L INTRODUCTION 

his study investigates whether earnings generated by changes in the effective tax rate 
| (hereafter, the tax change component of earnings) are persistent for future earnings.! 
In addition, this study investigates to what extent investors incorporate the forecast- 
ing implications of the tax change component of earnings into stock prices. The tax change 
component of earnings is likely to have implications for earnings persistence for a number 
of reasons. First, income taxes are of such magnitude that small changes in the effective 
tax rate (ETR) can lead to substantial changes in earnings. Second, corporate managers are 
placing greater emphasis on ETRs as in-house tax departments are increasingly viewed as 
profit centers (U.S. Department of the Treasury 1999; McGill and Outslay 2004).? Finally, 
recent academic literature, as well as the financial press, has identified ETR manipulation 
as an earnings management tool. Dhaliwal et al. (2004) and Comprix et al. (2006) find that 
firms lower ETRs when earnings absent tax management fall short of analyst forecasts. 

Myers et al. (2005) indicate that managers use ETRs to smooth reported EPS. 

The tax change comporient of earnings captures the aggregate effect of ETR reconcil- 
iation items, including permanent book-tax differences. However, ETR reconciliation items 
increasingly reflect the effects of strategic tax-planning/tax-optimization activities, which 
are designed to provide sustainable long-term benefits to earnings. For example, manage- 
ment of Tyco urged shareholders to reject a proposal to move its parent company back to 
the U.S. from Bermuda, stating that the impact on the company's current and future earnings 
(from an estimated seven-percentage-point increase in its current and future ETR) could 
have caused a $4 to $5 billion decrease in market capitalization (McGill and Outslay 2004). 
Observers speculate that items like these are a mnie reason for the increase in book-tax 
differences in the 1990s (Reason 2002). 

The collapse of Enron, the rise of global tax optimization (planning), and the renewed 
use of corporate tax shelters has resulted in increased scrutiny of book-tax differences and 
their implications for earnings quality and financial performance. My study contributes to 
the growing literature that examines the persistence and pricing implications of book-tax 
differences by directly examining how taxes contribute to earnings prediction, which is 
consistent with the view that one task of accounting research is to discover what information 
in the financial statements projects future earnings (Penman 1992).? Understanding the 
differential persistence of the tax change component and its forecasting implications should 
aid market participants in profitability analysis and in income forecasting. 


! A subtle difference between this study and studies that examine taxes as one of several fundamental signals 
used in financial statement analysis (Lev and Thiagarajan 1993; Abarbanell and Bushee 1997, 1998; Amir et 
21.2000) is how the effective tax rate (ETR) and the tax change component is defined. 1 define Ше ETR as total 
tax expense divided by pretax income and the tax change component as pretax income multiplied by the change 
in the ETR. Lev and Thiagarajan (1993) use current tax expense only in the numerator of tbe ETR and Abarbanell 
and Bushee (1997, 1998) and Amir et al. (2000) use pretax income adjusted for amortization of intangibles in 
the denominator of the ETR. Additionally, Abarbanell and Bushee (1997, 1998) and Amir et al. (2000) interact 
the change in ETR with the change in after-tax EPS in order to determine the earnings effect of ETR changes, 
which differs from my definition of the tax change component. 

? Beginning in 1995, Enron's tax division used a series of structured transactions to increase company profits. 
The deals were intended to create $1.9 billion in earnings over 30 years (Witt and Behr 2002). 

3 This.approach differs from that of Guenther and Jones (2003), who study the relationship between contempo- 
raneous stock returns and annual ETR-related earnings changes in order to determine whether annual ETR- 
related earnings changes are value-relevant. An important assumption in Guenther and Jones (2003) is that the 
value-relevance of ETR-related earnings is indicative of future earnings or growth opportunities of the firm. 
Guenther and Jones (2003) do not test whether ETR-related earnings changes are associated with realized future 
earnings, which is the primary focus of this paper. 
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Prior literature either ignores or aggregates taxes with other earnings components (Sloan 
1996; Hribar 2001; Nissim and Penman 2001; Barth et al. 2004) or examines the impli- 
cations of.information contained in temporary book-tax differences (deferred tax expense) 
for the quality of pretax earnings (Hanlon 2005). An additional branch of research examines 
the informativeness of estimated taxable income to capital markets (Shevlin 2002; Hanlon 
et al. 2005; Lev and Nissim 2004). 

In contrast to these studies, my decomposition of earnings treats taxes like any other 
earnings component (1.е., cash flows, changes in accounts receivable, changes in inventory, 
etc.), which allows me to directly test a maintained hypothesis that earnings generated by 
ETR changes are transitory or attempts at aggressive accounting by management. 

I find that there is a positive, significant association between the tax change component 
of earnings and future earnings, which disputes the broad notion advanced by prior literature 
(Lev and Thiagarajan 1993; Abarbanell and Bushee 1997, 1998; Amir et al. 2000) that 
ETR-related earnings changes are completely transitory. In contrast to these earlier studies, 
as well as concurrent research by Guenther and Jones (2003), I use the interim reporting 
requirements of APB No. 28 (APB 1973) and FASB Interpretation No. 18 (FASB 1977) 
to further decompose the tax change component into an initial and a revised portion based 
on the first quarter estimate of the annual ETR. I find that the initial tax change component 
is more persistent for future earnings than the revised tax change component.. 

This study also complements concurrent research that suggests tax-related earnings 
information is priced by the market (Hanlon 2005; Gleason and Mills 2004; Guenther and 
Jones 2003; Lev and Nissim 2004) by directly testing whether investors appropriately in- 
corporate the implications of ETR-related earnings changes for future earnings into market 
prices. 

Results from market tests indicate that investors underweight the persistence of the tax 
change component and the mispricing appears to be driven by the transitory nature of the 
revised tax change component. Supplemental tests confirm that the results obtained from 
the market tests are likely a result of mispricing rather than alternative explanations such 
as omitted risk factors or previously documented anomalies. 

I organize the remainder of the paper as follows. Section П presents an overview of 
corporate income tax disclosures, emphasizing the issues surrounding the ETR reconcilia- 
tion and how current reporting requirements affect my empirical proxies. Section III de- 
velops hypotheses. Section IV outlines the research design; Section V presents results and 
supplemental analyses. Section IV concludes. 


II. CORPORATE INCOME TAX DISCLOSURES 

Effective Tax Rates 
Disclosure 

Recent accounting standards provide guidance regarding ETR disclosures. Statement 
of Financial Accounting Standards (SFAS) No. 96 (FASB 1987, 128) and No. 109 (FASB 
1992, 147), Accounting for Income Taxes, require companies to disclose a reconciliation of 
the amount of income tax expense that would result from applying the domestic federal 
statutory rate to pretax income from continuing operations to the reported amount of income 
tax expense attributable to continuing operations for the year. When a firm reconciles its 
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statutory rate to the ETR, it must report any item that results in a material difference 
between the statutory rate and the ETR.* 


Reconciliation Adjustments 

Permanent differences cause the ETR to deviate from the statutory rate. Permanent 
differences are caused by items that enter into pretax book income but never enter into 
taxable income and items that enter into taxable income but never into pretax book income.’ 
Common examples of permanent differences include: 


* [tems recognized in pretax book income but not in taxable income: Interest on tax- 
exempt municipal bonds and life insurance proceeds on an officer of the corporation, 
qualified stock options accounted for using the fair value method, penalties and fines, 
key employee life insurance premiums, and most goodwill and in-process research 
and development (IPRD) in acquisitions. 

* Items recognized in taxable income but not in pretax book income: Percentage de- 
pletion in excess of cost depletion and the dividends-received deduction. 


Additionally, certain items that are technically temporary differences have the same 
effect as a permanent difference because a deferred tax liability is not recognized for these 
items (SFAS No. 109 431а-4). One example is the undistributed earnings of а foreign 
subsidiary consolidated for book purposes that management intends to reinvest indefinitely." 

Tax rate adjustments also cause the ETR to deviate from the statutory rate. Tax rate 
adjustments are special tax rates or tax credits that affect the statutory rate. Tax rate ad- 
justments are not permanent or temporary differences as they do not affect pretax income 
or taxable income (Weber and Wheeler 1992) and include: 


* Tax credits (e.g., the research and development credit or the foreign tax credit). 
e Different tax rates on foreign income. 
* Tax expense that includes both state and local income taxes, net of federal tax benefit. 


Finally, the BTR reconciliation includes adjustments not only for the tax effects of 
pretax income from continuing operations, but also for the tax effects of other items attrib- 
utable to continuing operations, including: 


* If no individual reconciling item amounts to more than 5 percent of the amount computed by multiplying the 
income before tax by the applicable statutory Federal income tax rate, and the total difference to be reconciled 
is less than 5 percent of such computed amount, then no reconciliation need be provided unless it would be 
significant in appraising the trend of earnings. Reconciling items that are individually less than 5 percent of 
income tax expense may be aggregated in the reconciliation (SEC Regulation S-X §210.4-08(h)(2)). 

5 Permanent differences are defined in 133 of АРВ No. 11. However, the term “permanent difference" is not used 
in SFAS No. 109. Instead, SFAS No. 109 (114) states that certain differences between the tax basis and ac- 
counting basis of assets and liabilities will not result in taxable or deductible amounts in future years, therefore 
no deferred tax liability or asset should be recognized. 

§ Until 1993, goodwill was always a permanent difference; it was amortized for book purposes for a period not 
to exceed 40 years and was not deductible on the tax return. After 1993, goodwill may be amortized over 15 
years for tax purposes (IRC section 197) as long as the tax bases of the acquired firm's assets are stepped-up. 
A step-up occurs almost exclusively in subsidiary sales or acquisitions of conduits. In these cases, goodwill 
results in a temporary difference. Goodwill created through the acquisition of a freestanding C-corporation, 
where basis step-ups are rare, usually results in a permanent difference (Scholes et al. 2004). IPRD is written 
off immediately for financial reporting purposes. Similar to goodwill, IPRD is not tax deductible as long as no 

? Undistributed earnings of subsidiaries consolidated for financial reporting purposes are recognized in pretax 
book income currently whether such subsidiaries are foreign or domestic. These carnings would usually be 
recognized in taxable income when distributed, resulting in a temporary difference. However, if a foreign sub- 
sidiary intends to reinvest its undistributed earnings indefinitely, no deferred tax liability is recorded, which 
effectively treats the undistributed earnings as a permanent difference. 
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e deferred tax asset valuation allowance account adjustments; 

9 changes in tax laws or tax rates; 

* changes in the tax status of an enterprise (e.g., from a partnership to a corporation); 
and 

® tax loss carryforward benefits. 


The tax change component of earnings reflects the aggregate effect of ETR reconciling 
items; therefore, the extent to which the tax change component of earnings persists and 
aids in forecasting future earnings provides an indication of how ETR reconciliation items 
can be used in financial statement analysis. 


Interim Reporting Requirements 

GAAP mandates that the quarterly reporting period be viewed as an integral part of 
the annual reporting period. APB Opinion No. 28 (АРВ 1973), SFAS No. 3 (FASB 1974), 
and FASB Interpretation No. 18 (FASB 1977) require firms to estimate many annual op- 
erating expenses and then allocate these estimates to interim periods based on forecasted 
figures. As the fiscal year progresses, firms revise estimates and record estimation errors 
from earlier quarters (Rangan and Sloan 1998). 

Income taxes are one type of expense that is subject to the interim reporting require- 
ments. Firms compute quarterly tax expense by estimating the ETR that is expected to be 
applicable for the full fiscal year. The estimated annual ETR is then applied to the year-to- 
date ordinary income at the end of each interim period, defined as pretax income before 
extraordinary items, discontinued operations, and cumulative effects of changes in account- 
ing principles (FASB Interpretation No. 18, 1977, 99), to compute the year-to-date tax 
expense. The year-to-date tax expense of the prior quarter is then subtracted from the year- 
to-date tax expense of the current quarter to arrive at the current quarter's tax expense. This 
procedure is illustrated in the following footnote from Hewlett-Packard's Q3 2000 10-Q: 


HP's effective tax rate was 23% for the first nine months of fiscal 2000, reflecting the 
tax rate expected for the full fiscal year. The tax rate was 21.3% in the third quarter 
of 2000 due to a reduction in the annual effective tax rate from 2496 to 2396. In 1999, 
the full-year effective tax rate was 26%, and the rate was 26.5% in both the third 
quarter and first nine months of the year. The year-to-year decrease in HP's effective 
tax rate was primarily the result of changes in the mix of our pretax earnings in various 
tax jurisdictions throughout the world. 


I use the interim reporting requirements to decompose the tax change component into 
an initial and revised portion based on the first quarter estimate of the annual ETR. This 
decomposition allows me to develop and test hypotheses regarding the differential persist- 
ence and forecasting implications of the tax change component of earnings. 


Ш. RESEARCH QUESTIONS AND HYPOTHESES 
The Persistence and Forecasting Implications of the Tax Change Component 
of Earnings 
The first objective of my paper is to document whether the tax change component is 
persistent for future earnings. Variations of the tax change component have been studied 


8 The integral approach was selected over a discrete approach for interim reporting. Under the discrete а; 
each interim quarter is viewed as a separate accounting period, therefore interim accruals and deferrals should 
reflect the same principles employed for annual reports. The discrete approach applies the same expense rec- 
ognition principles to both interim and annual reports with no special interim accruals or deferrals allowed. 


The Accounting Review, May 2006 


594 Schmidt 


previously as part of a set of fundamental signals used: by financial analysts to predict future 
performance (Lev and Thiagarajan 1993; Abarbanell and Bushee 1997, 1998). The under- 
lying hypothesis in these papers is that analysts view earnings generated from ETR changes 
as transitory? Abarbanell and Bushee (1997) find that ETR-related earnings changes are 
less persistent for future earnings changes and long-term earnings growth than aggregate 
earnings changes from 1983-1990, but do not directly test (or report) the extent to which 
ETR-related earnings changes are associated with future earnings changes. Despite this, 
Abarbanell and Bushee (1997) state that the ETR variable captures more than transitory 
effects, possibly unidentified risk factors or structural. changes. 

The view that earnings generated from ETR changes contain some transitory elements 
is reasonable because the ETR can be used for period-specific earnings management (i.e., 
beating analyst forecasts, avoiding a loss, or avoiding a decrease in earnings). Recent re- 
search examines earnings management through aggregate ETR changes (Comprix et al. 
2006; Dhaliwal et al. 2004; Myers et al. 2005) and specific ETR reconciling items such as 
the DTVA (Miller and Skinner 1998; Visvanathan 1998; Bauman et al. 2001; Burgstahler 
et al. 2003; Frank and Rego 2003; Schrand and Wong 2003), permanently reinvested foreign 
earnings (Krull 2004), intangible assets (Nelson et al. 2003), and the tax cushion (Smith 
2001; Gleason and Mills 2002; Nelson et al. 2003). The managed portion of earnings could 
be transitory (or less persistent), resulting in a lower coefficient on managed earnings in 
predicting future earnings, compared to the coefficient on unmanaged earnings (Xie 2001; 
Joos et al. 2003; Hanlon 2005). 

However, some of the most prominent ETR reconciliation items, including foreign tax 
rate differentials and state taxes, likely reflect long-term (and therefore persistent) strategic 
tax-planning activities as well as contain forward-looking information that may shed light 
on the future prospects of the firm.!° Mills et al. (1998) show that there is a positive 
relationship between investments in tax planning and long-term strategy indicators such as 
the presence of foreign assets, number of entities, and capital intensity. Bodnar and 
Weintrop (1997) find that foreign operations provide greater growth opportunities than 
domestic operations and Thomas (2000) finds that foreign earnings changes are more useful 
than domestic earnings changes in predicting future earnings changes. If greater growth: 
opportunities lead to more persistent earnings (Collins and Kothari 1989), ETR-related 
earnings in foreign jurisdictions may be more permanent depending on the extent to which 
the company maintains or expands its operations in foreign jurisdictions. 

Further, state tax-planning activities have grown tremendously in the 1990s, in part due 
to fewer (and more costly) federal tax-planning opportunities, economic-development- 
oriented tax incentive competition among states, and more aggressive state tax compliance 
due to state fiscal problems. Gupta and Mills (2002) show that state ETRs decrease (at a 
decreasing rate) and then increase (at an increasing rate) as a function of the number of 
state tax returns filed by the firm. This information would be reflected in the ETR and 


9 This hypothesis, which originated in Lev and Thiagarajan (1993), likely comes from the following sources: (1) 
various accounts in O’Glove (1987, 72) where a decrease in the ETR is characterized as a nonrecurring/ 
nonoperating item, (2) a survey by Siegel (1982), which indicates that analysts, accountants, and financial 
managers think that an increase in earnings from the utilization of a net operating loss carryforward represents 
a deterioration of earnings quality, and (3) articles in publications such as the Wall Street Journal that discussed 
characteristics of company earnings releases. 

10 Strategic tax-planning activities also include ЕТК “optimization” strategies such as transferring intangibles 
offshore, transfer pricing, structuring low-tax contract manufacturing operations, using holding companies to 
minimize withholding taxes on dividends, and engaging in intercompany financing techniques like debt push- 
downs. 
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would increase the persistence of the tax change component of earnings through the extent 
to which the benefits of state tax planning result in permanent tax savings. 

The persistence of the tax change component could also be affected by the renewed 
use of tax shelters starting in the mid-1990s.!! While most tax shelters are thought to 
provide one-time benefits, evidence suggests that some tax shelters could have more per- 
sistent effects on earnings. For example, Graham and Tucker (2005) find that the average 
tax shelter used by firms in their sample was active for approximately five years. While 
most tax-sheltering activity is likely not as persistent as long-term strategic tax planning, 
there could be a positive effect on the persistence of the tax change component from tax 
sheltering to the extent that certain tax shelters provide benefits over multiple years. 

Since the tax change component. of earnings (as does any earnings component) has 
effects that are expected to be either persistent or transitory, whether the tax change com- 
ponent of earnings persists or aids in forecasting future earnings depends on the relative 
proportion of persistent and transitory effects. The previous discussion suggests that the tax 
change component of earnings primarily reflects a combination of long-term strategic tax 
planning (expected to be relatively persistent), tax-sheltering activities, and period-specific 
earnings management activities (both expected to be relatively transitory). This raises the 
following question: 


КО), : Is the tax change component of earnings persistent for future earnings (1.е., useful 
in forecasting future earnings incremental to earnings excluding the tax change)? 


The Persistence and Forecasting Implications of the Tax Change Component of 
Earnings in Interim Periods 

I next consider the persistence of the tax change component of earnings in interim 
periods. The results of prior research on the time-series properties of quarterly earnings are 
consistent with the notion that interim earnings become less persistent (more transitory) as 
the fiscal year progresses.!? Thus, even without considering aspects specific to tax expense, 
one might expect the tax change component early in the year to be more persistent than 
any revisions. 

However, the interim reporting requirements for income tax expense suggest additional 
arguments for the tax change component to be more or less persistent. APB No. 28 requires 
managers to make their best estimate of the firm’s ETR that is expected to be applicable 
for the full fiscal year when computing interim income tax expense. The estimated annual 
ETR should reflect anticipated tax credits, foreign tax rates, percentage depletion, capital 


11 Tax advisors and corporate directors regard a transaction with low visibility that results in a permanent difference 
as the perfect tax shelter (U.S. Department of Treasury 1999; Hanlon 2003; Boynton and Mills 2004; McGill 
and Outslay 2004). These shelters would affect the ETR and, thus, the tax change component of earnings. 

12 Collins et al. (1984) find that the forecast errors tend to increase as the fiscal year progresses, while Bathke and 
Lorek (1984) find that time-series forecast errors are larger in the fourth quarter. Salamon and Stober (1994) 
and Kross and Schroeder (1990) present evidence that earnings response coefficients are significantly smaller 
for fourth quarter earnings than for interim earnings, which is consistent in part with the existence of fiscal year- 
end discretionary accruals from earnings management. Lipe and Bernard (2000) find that the differential stock- 
price response to fourth quarter earnings is most likely due to increased recognition of transitory earnings in 
the fourth quarter, opposed to the traditional explanation of the annual audit correcting interim reporting errors. 
The Lipe and Bernard (2000) finding is consistent with Burgstahler et al. (2002), who find that special items, 
which proxy for transitory earnings, occur more frequently during quarters later in the fiscal year, with about 

. 41 percent occurring in the fourth quarter, and Livnat (2003), who finds that fourth quarter earnings have lower 
persistence than earnings from the first through third quarters and the decline in persistence is attributed mostly 
to extremely negative earnings surprises. 
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gains rates, and other available tax-planning alternatives (FASB Interpretation No. 18, 1977, 
18). 

Interviews with current and former tax partners suggest that it is likely that the items 
that have the most material (and persistent) effect on management's estimate of the annual 
ЕТК are anticipated and incorporated into the initial (Q1) annual ETR estimate.? For 
example, it may be that early in the year, managers make assumptions regarding the relative 
long-term strategic mix of income from various sources (i.e., domestic versus foreign earn- 
ings, taxable versus tax-advantaged income) and use these estimates (in part) to arrive at 
the initial annual estimate of the ETR. As period-specific information becomes available in 
subsequent quarters (Q2, Q3, Q4), managers may revise the annual ETR estimate for period- 
specific deviation in the income weights.!^!5 

Anecdotal evidence suggests that analysts often view revisions in estimated annual 
ETRs as attempts at aggressive accounting, and prior research finds that firms change ETRs 
to beat analysts' forecasts or to smooth earnings, consistent with earnings management 
(Dhaliwal et al. 2004; Comprix et al. 2006; Myers et al. 2005). However, it is ultimately 
an empirical question as to whether the initial (Q1) tax change component of earnings 
behaves differently than subsequent revisions (Q2, Q3, Q4) in the tax change component 
of earnings.!ó If tax-planning (earnings management) activities are more persistent (transi- 
tory) and if the initial tax change component reflects more tax-planning activities than 
earnings management activities, then the initial tax change component will be more per- 
sistent (or less transitory). Likewise, if the revised tax change component reflects more 
earnings management activities, then the revised tax change component will be less persis- 
tent. Stated formally: 


Н,: The initial (Q1) tax change component of earnings is more persistent for future 
earnings than subsequent revisions (Q2, Q3, Q4) in the tax change component of 
earnings. 


13 The partners stressed that managers are aware of the major permanent differences and other ETR reconciliation 
items when firms predict end-of-year book income. Furthermore, managers do not like surprises from the tax 
department, which often leads to conservative initial ETR estimates. When an ETR is changed after the initial 
estimate is made at the end of the first quarter (Q1), it is usually due to ane of the following reasons: (1) there 
is a surprise, such as an audit settlement that is better than what was previously anticipated; (2) new tax planning; 
(3) there is a return to reality (і.е., the initial conservative ETR estimates from Q1 are “trued up" as future 
earnings are realized); (4) earnings management, or (5) significant variations in pretax earnings (especially pretax 
earnings close to zero). Of these reasons, the partners agreed that new tax planning is least likely to occur. 

14 Additional insight into this phenomenon (albeit with different income sources) can be gained from the Q3 2000 
MD&A of Viad (2000): “Тһе relatively low ЕТЕ compared to the statutory federal rate is primarily attributable 
to tax-exempt income from Viad's Payment Services businesses. APB Opinion No. 28 requires that income 
taxes be provided based on the estimated ETR expected to be applicable for the entire fiscal year, with an 
adjustment of the annual ETR made each quarter. During the third quarter, Viad determined that the estimated 
annual ETR for 2000 is expected to be lower than in prior periods due to higher than previously expected tax- 
exempt income in proportion to total pretax income, resulting from rapid growth in investments in tax-exempt 
securities in the Payment Services segment along with lower operating income in the Convention and Event 
Services Segment. Accordingly, the adjustment of the ETR was made in the third quarter of 2000." 

15 This could also be an additional way the ETR is managed. In fact, when companies make interim ETR revisions, 
one of the most common explanations for the revision is a change in the estimated mix of income from foreign 
and domestic sources (for example, see the Q3 2000 10-Q of Hewlett-Packard, a part of which is presented in 
Section II). 

1$ Unless a tax shelter provides benefits for multiple years, it is unlikely that management's estimate of the annual 
ETR will reflect the tax benefits from the shelter; therefore, it is not likely that this type of activity will affect 
the persistence of the initial and revised tax change component in a different manner. 
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Valuation Implications of the Tax Change Component of Earnings 

My final objective is to determine if market participants recognize the forecasting im- 
plications of the tax change component. Until recently, the empirical evidence (Lipe 1986; 
Lev and Thiagarajan 1993; Abarbanell and Bushee 1997, 1998) on the value-relevance of 
ETR-related earnings changes has been inconclusive and inconsistent. Guenther and Jones 
(2003) reconcile these studies and determine that the conflicting results are due primarily 
to how the tax change component is measured in each study (see footnote 1). 

Guenther and Jones (2003) document a significant, positive association between con- 
temporaneous abnormal returns and the tax change component and show that this associ- 
ation varies cross-sectionally and over time. Guenther and Jones (2003) hypothesize that 
the variation in the magnitude of the association between contemporaneous returns and the 
tax change component is due to the persistence of the tax change component in different 
contexts." However, this expectation assumes that the market is efficient with respect to 
the implications of the tax change component for future earnings.!? 

Despite the recently documented value-relevance of the tax change component, there 
is conflicting evidence about the efficient use of this information by market participants. 
For example, Abarbanell and Bushee (1997) find that analysts overreact to annual ETR- 
related earnings changes before the implementation of SFAS No. 109, while Chen and 
Schoderbek (2000) find that analysts failed to incorporate the deferred tax adjustment re- 
quired by SFAS No. 109 into their earnings forecasts when corporate tax rates increased 
in 1993. Bauman and Shaw (2005) find that analysts' quarterly earnings forecasts incor- 
porate (underreact to) the earnings effects of ETR increases and small ETR decreases (the 
earnings effects of large ETR decreases) in previous quarters after the implementation of 
SFAS No. 109. 

Evidence of analyst forecast inefficiency raises the question of whether stock prices 
fully reflect the persistence of the tax change component of earnings. Further complicating 
matters; research indicates that investors inefficiently utilize information contained in ana- 
lysts' earnings forecasts. Mendenhall (1991), Walther (1997), and Elgers et al. (2001, 2003) 
document that investors underweight information contained in analysts’ forecasts, while 
LaPorta (1996) and Bradshaw et al. (2001) find that investors overweight information con- 
tained in analysts’ forecasts. Therefore, it is not entirely clear that investors’ expectations 
of the forecasting implications of the tax change component will be consistent with the 
implications of the tax change component for future earnings. This leads to the final 
hypotheses: 


Ha: The earnings expectations embedded in stock prices fail to fully reflect the per- 
sistence of the tax change component of earnings. 


17 The positive coefficient on the tax change component is significantly smaller after firms’ adoption of SFAS No. 
109 in 1993. Further, in the post-SFAS No. 109 period the coefficient on the tax change component is signifi- 
cantly smaller for firms reporting the utilization of a net operating loss and for firms recording restructuring 
charges or asset impairments, and is significantly larger for firms engaged in acquisitions and for firms that 
experience a change in the percentage of foreign earnings. 

15 The results of Lev and Nissim (2004) and Hanlon (2005) weaken this assumption. Lev and Nissim (2004) 
document that the market does not fully refiect the information contained in a tax fundamental that captures 
aggregate book-tax differences. Hanlon (2005) finds that investors underestimate the persistence of the cash flow 
component of earnings for firm-years with large positive temporary book-tax differences (1.е., book income 
greater than taxable income) and overestimate the persistence of accruals when firm-years contain large negative 
temporary book-tax differences (i.e., book income less than taxable income). 
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H: The earnings expectations embedded in stock prices fail to fully reflect the per- 
sistence of the initial (Q1) and revised (Q2, Q3, Q4) tax change component of 
earnings. 


IV. RESEARCH DESIGN 
Model Development 
To focus on the effect of an ETR change on earnings, I follow Lev and Thiagarajan 
(1993) and decompose the annual change in earnings, AE, = E, — E,_,, into two 
components: 


(1) The change in pretax earnings (АРТЕ), using last year’s effective tax rate (ETR, .,): 
APTE(1 — ETR,_,), and 
(2) The effect of the annual ЕТЕ change on current pretax earnings: PTE,(ETR,.., 
): 
t 


AE, = APTE(1 — ETR,_,) + PTE(ETR, , — ETR,). (1) 


In Equation (1), PTE(ETR, , — ETR,), which represents earnings generated by a change 
in the annual ЕТК, is my measure of the tax change component of earnings.? ЕТК is a 
firm's effective tax rate, measured as total income tax expense divided by PTE. Ynterim 
(annual) Compustat data items used for each variable are: Earnings — 8 (18), Income Tax 
Expense = 6 (16), and Pretax Earnings = 23 (170). Following Gupta and Newberry (1997), 
the ETR is set to 1 when the ETR is greater than 100 percent, and to 0 when the ETR is 
negative. 


The Persistence and Forecasting Implications of the Tax Change Component 
of Earnings 

I examine the persistence of the tax change component cf earnings for future earnings 
using the following санан (firm subscripts are omitted): 


Ен = Yo + » ATE, + y,TCC, + v4 (2) 


where E represents income before extraordinary items, ATE represents aggregate earnings 
excluding the tax change component [ATE = РТЕП — ETR,_,)], and TCC represents the 
tax change component of earnings [ТСС = РТЕСЕТК, | — ЕТК,)].2° I scale all earnings 
component variables by contemporaneous average total assets (Compustat annual data item 


19 The following example, adapted from Lev and Thiagarajan (1993), should provide intuition for this measure. 
Assume: 


ct. 1-1 


Pre-tax income 120 100 
ETR 0.25 0.40 
Net income 90 60 


Following Equation (1), the net income change, 30, consists of the pretax earnings change net of the prior tax 
rate, 20 X (1 — 0.40) = 12, and the effect of the tax rate change, 120 (0.40 — 0.25) = 18. I refer to the latter 
component of the earnings change as the tax change component. 

My partitioning of earnings into a tax component (TCC) and a pretax component (ATE) generally assumes that 
the pretax component is unrelated to the tax component. I acknowledge that some tax-planning choices directly 
affect pretax earnings through implicit taxes. Although it is possible that some of the pretax persistence is related 
to implicit taxes, I assume that the magnitude of any implicit tax effects is too small to affect my inferences. 
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#6) to control for differences in size across firms and over time and to allow for cross- 
sectional comparability. 


The Persistence and Forecasting Implications of the Tax Change Component of 
Earnings in Interim Periods 

The interim reporting requirements of APB Opinion No. 28 and FASB Interpretation 
No. 18 allow me to further decompose the tax change component of earnings into an initial 
and a revised component based management’s first quarter estimate of the ETR that is 
expected to be applicable for the full fiscal year. Hypothesis 1 predicts that Equation (2) is 
misspecified: since it implicitly constrains the coefficients on the initial and revised tax 
change components of earnings to be equal. The specification implied by H, is: 


Ела = б + SATE, + &JINTCC, + 5,КЕУТСС, + v, (3) 


where INTCC represents the initial tax change component of earnings [PTE,(ETR,_, 
— ЕТКОЇ,)) and REVTCC represents the revised tax change component of earnings 
[PTE(ETRQI, — ЕТЕД]. ETRQ1, is the first quarter estimate of the annual ЕТК. Hypothesis 
1 predicts 8, > 8,. The larger coefficient predicted on the initial relative to the revised tax 
change component of earnings reflects the higher persistence expected from the initial 
tax change component of earnings. To mitigate concern that any differential persistence is 
driven by asset growth, I scale the dependent variables in Equations (2) and (3) by lagged 
average total assets so the scalar is the same as that used for the independent variables 
(Fairfield et al. 2003). 

Finding a significant positive relation for any coefficient in Equations (2) or (3) indi- 
cates that the component is incrementally informative in predicting future earnings. For 
each earnings component, a slope coefficient of 0 indicates that the earnings component is 
purely transitory, while a coefficient of 1 indicates that the earnings component follows a 
random walk. A slope coefficient greater than 1 indicates growth in the component. 


Valuation Implications of the Tax Change Component of Earnings 

My final objective is to test whether the market fully incorporates the forecasting im- 
plications of the initial, revised, and annual tax change component of earnings. To test H,, 
and H5, I estimate the following systems of equations in accordance with the Mishkin 
(1983) framework: 


Е = Yo + у АТЕ, + у ТСС, + Vii (2) 
АК, = Фо + (Е, — Yo — УТАТЕ, — УХТСС,) + є, (4) 
Ej, = $ + ATE, + 6,ДҮТСС, + &4REVICC, + vai (3) 


АК, = ey + о (В — бо — 8fATE, — 83ЇМТСС, - S$REVICC) + ва (5) 


where AR,,, = the abnormal return in year 1+1, measured as the 12-month buy-and-hold 
return beginning four months after the fiscal year-end minus the buy-and-hold value- 
weighted return of a benchmark portfolio over the same 12-month period (Daniel et al. 
1997). I create 125 (5 X 5 X 5) size/book-to-market/prior year return portfolios each year 
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to control for a set of common risk factors: size, book-to-market, and momentum.?! I 
measure each characteristic at the end of the most recent fiscal year prior to portfolio 
formation. 

Equation (2) is a forecasting equation that estimates the forecasting coefficients (у) of 
aggregate earnings excluding the tax change and the tax change component for predicting 
future earnings.” Each coefficient is a measure of the persistence of an earnings component 
for future earnings. Equation (4) is a valuation equation that estimates valuation coefficients 
(y*) that the market assigns to earnings components. Mishkin (1983) suggests that Equation 
(4) provides an estimate of the market's perceived persistence of earnings and the tax 
change component. Because earnings and the tax change component of earnings are public 
information, market efficiency requires у; = y¥, and у, = үх. If any pair of coefficients is 
not equal, then the market's perception of earnings persistence differs from the actual 
earnings persistence implied by the specification of the forecasting equation. 

I estimate the system of Equations (2) and (4) jointly in two stages using iterative 
generalized nonlinear least squares (Mishkin 1983). In the first stage, I jointly estimate an 
unconstrained system of Equations (2) and (4). In the second stage, I impose the rational 
pricing constraints ү, = Yf (а = 1 or 2) and re-estimate Equations (2) and (4) jointly. I 
test market efficiency with the likelihood ratio statistic described in Mishkin (1983). The 
specific likelihood ratio test is 2nlog(SSR°/SSR“) ~ ха) where n = the number of obser- 
vations, 4 = the number of rational pricing constraints, SSR° = the sum of squared residuals 
from the constrained weighted system, and SSR“ = the sum of squared residuals from the 
unconstrained weighted system. 


Sample Selection Procedures and Sample Profile 


My sample consists of all U.S. Compustat firm-years between 1994 and 2001 with non- 
missing values of annual and quarterly assets, total income tax expense, and pretax income 
(36,442 observations). Due to the difficulty in interpreting ETRs when a firm has negative 
pretax income, I exclude 13,581 firm-years with negative pretax income. Then, I delete 
1,591 mutual funds, trusts, REITs, limited partnerships, and other flow-through entities due 
to different reporting requirements and varying earnings management and tax-planning in- 
centives. Next, I exclude 7,753 firm-years with missing CRSP return data. Finally, to re- 
move the effects of outliers from the data, I drop 822 firm-years with the highest and lowest 
] percent of the values for each variable in year t (Kothari and Zimmerman 1995; Collins 
et al. 1997; Fama and French 1998; Barth et al. 1999).? My final sample consists of 12,695 
firm-years, which are distributed uniformly throughout the sample period. 


V. EMPIRICAL RESULTS 
Descriptive Statistics 
Table 1 reports descriptive statistics for the variables used in this study. Panel A de- 
scribes the variables that I use in the calculation of the tax change component of earnings. 
The average annual ETR in the prior and current year is 35.68 percent and 35.47 percent, 
respectively, while the average ETR in Q1, which represents management's expectation of 
the annual ЕТК, is 35.93 percent. ЕТКОЇ, is significantly higher than ЕТК, , and ЕТК, 


21 Firms must have calendar year-end market capitalization values available on CRSP and positive fiscal year-end 
book value data (annual data item £60) available on Compustat. 

2 The discussion of the Mishkin (1983) procedure for Equations (2) and (4) generalizes to Equations (3) and (5). 

23 Extreme returns or earnings in year t+1 are not eliminated since this would introduce hindsight bias in the 
results (Thomas 2000). I also run all of my tests on data that has been winsorized rather than trimmed. The 
inferences from all of my tests are unchanged when I use the winsorized data. 
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TABLE 1 
Descriptive Statistics 
Standard 1st 3rd 
Variable _Mean _ Deviation Quartile Median Quartile 
Panel A: Financial Varlables* 
Assets, 6306.79 34104.66 163.003 553.621 2161Л40 
РТЕ, 275.72. 1002.81 11.679 40.740 147.345 
ETR, 0.3563 0.0840 0.3304 0.3676 0.3939 
ETR; 0.3547 0.0788 0.3318 0.3690 0.3945 
ЕТКОЇ, 0.3593 0.0756 0.3389 0.3716 0.3960 
Panel B: Regression Variables” 
| 0.0659 0.0544 0.0243 0.0534 0.0932 
АЁ, 0.0409 0.6180 —0.2951 —0.0393 0.2639 
ТСС, —0.0004 0.0098 —0.0014 0.0000 0.0015 
ATE, 0.0700 0.0532 0.0247 0.0534 0.0938 
TCC, —0.0002 0.0044 —0.0011 0.0000 0.0010 
INTCC, —0.0005 0.0042 —0.0012 —0.0001 0.0002 
REVICC, 0.0003 0.0029 —0.0003 0.0001 0.0015 
Panel C: Risk and Anomaly Control Variables‘ 
MVE, 3373.70 15896.93 104.35 397.86 1506.92 
BMR, 0.2701 1.8811 —0.1415 0.1567 0.6970 
PBHR, 0.1807 0.5498 —0.1228 0.1071 0.3701 
BETA, 0.8789 0.5997 0.4732 0.8089 1.1958 
SPEC, —0.0024 0.0166 0.0000 0.0000 0.0000 
DTAX, —0.0018 0.0115 —0.0065 —0.0009 0.0033 
VOL, 95.1764 113.4364 33.7918 61.2824 108.9021 
EP, 0.0698 0.0398 0.0457 0.0642 0.0858 
PTXACC, —0.0244 0.0761 —0.0642 —0.0306 0.0057 
PTXCF, 0.1462 0.1025 0.0840 0.1377 0.2024 
Panel D: Industry Composition? 
| Меап Mean Mean 
Industry Membership ТСС INTCC REVTCC 
Mining and Construction —3.1851 —6.5646 3.3795 
Food 1.1964 —2.0408 3.2372 
Textiles, Printing, Publishing —0.0920 —3.4453 3.3534 
Chemicals 0.0651 —7.4574 7.5225 
Pharmaceuticals —0.4225 —8.2941 7.8716 
Extractive Industries —15.0310 —14.3429 —0.6881 
Durable Manufacturers —2.0190 —8.1600 6.1410 
Computers —4.6170 й ~7.4794 2.8624 
Transportation —2.3980 —1.1700 —1.2280 
Utilities —3.4671 —5.9137 2.4466 


(continued on next page) 
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TABLE 1 (Continued) 


Mean Mean Mean 

Industry Membership — Tce INTCC REVTCC 
Retail —1.3513 —4.1479 2.1967 
Financial Institutions — 1.6000 —1.9117 0.3117 
Insurance and Real Estate —4.6166 —7.9737 3.3571 
Personal Services —3.8471 —6.2612 24141 
Professional Services —3.7826 —5.6880 1.9054 
Other —2.1882 —7.6665 5.4783 

Total —2.9597 —6.1573 3.1976 


* Panel A variables are defined as follows: Assets = total assets (Compustat annual item #6); PTE = Pretax 
income (Compustat annual item #170); ETR = Effective tax rate (Compustat annual item #16 + PTE); and 
ЕТКОЇ = 01 Effective tax rate (Compustat quarterly item #16 + Compustat quarterly item #23). 

> Panel B variables are defined as follows: E = income before extraordinary items (Compustat annual item #18). 
AR = the 12-month buy-and-hold security return beginning four months after the fiscal year end minus the 
value-weighted return of the benchmark size/book-to-market/prior year return portfolio over the same 12- 
month period. The benchmark portfolios are created by first sorting stocks into quintiles based on their 
capitalization values (MVE) at the beginning of the year of portfolio formation. NYSE breakpoints are used so 
that there are an equal number of NYSE stocks in each quintile. Next, within each of the size quintiles, stocks 
are sorted into quintiles based on their industry-adjusted book-to-market ratios. The book-to-market (BMR) 
ratio is defined as the book value at the end of the fiscal year prior to the year of portfolio formation divided 
by capitalization value at the beginning of the calendar year of portfolio formation. Book-to-market ratios are 
industry adjusted by subtracting the mean industry book-to-market ratio over the sample period from the 
individual stock’s book-to-market ratio, where industries are defined along two-digit SIC codes. Finally, within 
the 25 size/book-to-market portfolios, stocks are sorted based on their pricr 12-month return (PBHR) ending 
one month prior to portfolio formation. TCC = the tax change component of earnings, measured as 
PTE(ETR,., — ETR). ATE = aggregate earnings excluding the tax change component of earnings, measured 
as PTE(1 — ETR,..;). INTCC = the initial tax change component of earnings, measured as PTE(ETR,.., 
~ ETRQI). REVICC = the revised tax change component of earnings, measured as РТЕ(ЕТКОЇ, — ETR). - 
All regression variables (except AR), are scaled by contemporaneous average total assets. 

* Panel C variables are defined as follows: MVE = market value of equity (Compustat annual item #25 * #199); 
BMR = book-to-market ratio (Compustat annual item #60 + MVE); PBHR = 12-month buy-and-hold return, 
defined above; BETA — systematic risk estimated from regressions of monthly raw returns on a value-weighted 
portfolio over a 60-month period prior to the abnormal return accumulation period; SPEC, — special items 
(Compustat annual item #17); DTAX = deferred income tax expense (Compustat annual item #50); VOL 
= annual trading volume divided by common shares outstanding; EP = the earnings-to-price ratio, measured 
as income before extraordinary items divided by the fiscal year-end stock price; PTXACC = pretax accruals, 
defined as pretax income minus pretax operating cash flows; and PTXCF = pretax operating cash flow, 
measured as operating cash flow plus cash taxes paid minus cash flow due to extraordinary items (Compustat 
annual item #308 + #317 — $124). SPEC, DTAX, PTXACC, and PTXCF are scaled by contemporaneous 
average total assets. 

4 All means are multiplied by 10,000. 


(t = 4.7462 and 2.4925, respectively). I obtain similar inferences using the nonparametric 
Wilcoxon signed-rank test (ЕТКОЇ, is significantly higher than ЕТК, , and ЕТК, [z 
= 19.102 and 14.530, respectively]). This seems to indicate that on average, management 
sets the estimate of the annual ETR too high in Q1, perhaps to create a reserve from which 
current earnings can be drawn, or because tax departments are conservative in their initial 
estimates (Comprix et al. 2006). 

Panel B of Table 1 reports the descriptive statistics for the variables used in the re- 
gression analyses. The mean of E,,, is 0.0659, which indicates the average scaled earnings 





24 The unique feature of the integral view of quarterly reporting for taxes is that the estimation of the initial ЕТЕ 
creates opportunities for intra-year earnings management (annual ETR estimates are used to calculate quarterly 
income tax expense), which would affect current annual earnings, but not necessarily future annual earnings. 
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for the sample firms is approximately 6.6 percent. The average TCC, accounts for approx- 
imately 2.1 percent of the average change in return on assets (ROA). INTCC, decreases 
ROA by approximately 5 percent on average, while REVTCC, increases ROA by approxi- 
mately 3 percent on average. Therefore, for the average firm in the sample, approximately 
60 percent of INTCC, reverses by year-end. These statistics are consistent with those from 
Panel A and quantifies the effect of tax changes on earnings. Panel C presents descriptive 
statistics for the risk and anomaly control variables. 

Panel D of Table 1 presents industry descriptive statistics, where the industry break- 
down is adapted from Barth et al. (1999) and Rajgopal et al. (2003). The majority of the 
sample is clustered in four industries: financial institutions (20.17 percent), durable manu- 
facturers (19.08 percent), retail (12.24 percent), and computers (7.63 percent). I also ex- 
amine the means of ТСС, INTCC, and КЕУТСС, at the industry level in Panel D of Table 
1. The degree of reversal in INTCC, by the end of the year is startling for some industries. 
For example, in the food industry, 159 percent of INTCC, reverses by year-end, while 101 
percent of INTCC, reverses by year-end for firms in the chemicals industry. Other industries 
exhibiting this strong reversal in INTCC, include textiles, printing and publishing (97 per- 
cent), retail (67 percent), pharmaceuticals (95 percent), and durable manufacturers (75 
percent). 

Table 2 reports Pearson correlations. Consistent with Guenther and Jones (2003), TCC, 
is positively correlated with contemporaneous returns (pp = 0.0218, p = 0.0140). INTCC, 
is also positively correlated with contemporaneous returns, while the correlation between 
КЕУТСС, and returns is not significant. INTCC, (REVTCC,) is negatively (positively) cor- 
related with ATE, (pp = —0.1029, p = 0.0000 and p, = 0.0270, p = 0.0024). Finally, the 
correlation between INTCC, and КЕУТСС, is significantly negative (pp = —0.2778, р 
= 0.0000), as is the correlation between pretax accruals and pretax operating cash flow (pp 
= —0.6455, р = 0.0000). 


Multivariable Results 

Tables 3 and 4 present the results from estimating Equations (2) and (3) using OLS.” 
Because the data are pooled, I estimate the regressions in the spirit of Fama and MacBeth 
(1973). I estimate yearly cross-sectional regressions and use the average slopes and their 
time-series standard errors to draw іпѓетепсез.26 In Tables 3 and 4, I list the mean coefficient 
estimate first, followed by the Fama-MacBeth t-statistic and the number of yearly positive 
(negative) coefficients. 

I also partition the sample and estimate Equations (2) and (3) separately for firms with 
decreases (increases) in annual ETRs. Because decreases in annual ETRs increase earnings 
(or conversely, increases in annual ETRs decrease earnings), ceteris paribus, managers will 


25 Results from the Breusch and Pagan (1980) Lagrange Multiplier Test (y? = 0.59, р = 0.4431) indicate that a 
fixed or random effects specification is not appropriate for Equation (2). This result holds for all specifications 
of Equations (2) and (3). 

26 Assuming independence through time, the Fama-MacBeth regressions control for cross-sectional correlation in 
the residuals. In the presence of positive cross-correlation in the residuals, the standard errors can be biased 
downward and the t-statistics biased upward in pooled regressions. For example, Fama and French (2000) find 
that Fama-MacBeth standard errors are two to five times the OLS standard errors from pooled regressions using 
their sample. | 

However, the assumption of temporal independence in regressions without abnormal returns as the depen- 
dent variable is tenuous, as many firms appear in the regressions multiple years. Any autocorrelation in the 
coefficients would further bias the standard errors, but reliably estimating the serial correlation from eight 
observations is problematic. Fama and French (2000) suggest that if the first-order autocorrelation is 0.5, t- 
statistics of 2.8 (instead of 1.96) should be used to infer reliability. Accordingly, I require t-statistics of 2.8 to 
indicate significance of the coefficient estimates in Equations (2), (3), (6), and (7). 
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want to sustain ETR decreases and mitigate ETR increases. Therefore, the results also 
provide evidence on the persistence and forecasting implications of the tax change com- 
ponent of earnings under different conditions. 


і 


The Persistence and Forecasting Implications of the Tax Change Component 
of Earnings 

Table 3 presents the results from estimating Equation (2), in silua future earnings are 
regressed on current earnings and the tax change component. The estimate of the persistence 
of ATE, in Panel A is 0.8106 (t — 34.53), while the persistence estimate of TCC, is 0.7121 
(t = 6.56). The coefficient on TCC, is clearly different from zero, which is initial evidence 
that the tax change component of earnings is not transitory. When I estimate Equation (2) 
separately for firms with either annual ETR decreases (Panel B) or increases (Panel C), the 
results continue to indicate that the tax change component is not transitory. Specifically, 














TABLE 3 
Summary Regression Statistics of the Forecasting Implications of the m Change Component 
of Earnings? 
Emi = Yo + YATE, + Үү›ТСС, T 1775) { (2) 

Panel A: Full Sample (n = 8) | 
Parameters” ae. Эн n R? 
Estimate 0.0091 0.8106 0.7121 0.6041 
t-statistic 9.49* 34.53* 6.56*° 37.80* 
Pos/(Neg) 8/(0) 8/(0) 8/(0) 
Panel B: ETR Decreases (n = 8) | 
Parameters Ye a | ae Ya, m 

[ 
Estimate 0.0073 0.8286 0.7373 0.6777 
t-statistic 6.45* 35.02* 3.719 41.17* 
Pos/(Neg) 8/(0) 8/(0) та) 
Panel С: ETR Increases (n = 8) | 
Parameters. аа ГА h ка 

| 
Estimate 0.0108 0.7935 0.7198 0.5696 
t-statistic 13.73* 39.36* 3.08*° 27.33* 
Pos/(Neg) 8/(0) 8/(0) 6/(2) 


The symbols *, t, and + denote conventional significance at the 0.01, 0.05, and 0.10 (two-tailed) levels, 

respectively. In order to account for potential serial correlation, I interpret t-statistics of 2.8 or higher on the 

regression coefficients as an indication of significance at the 0.05 level (Fama and French 2000). 
* This table presents summary statistics of annual cross-sectional regressions estimated! using the Fama and 
MacBeth (1973) approach. I use average slopes and their time-series standard errors to draw inferences (mean 
coefficients are reported first, followed by the t-statistic and the number of yearly positive (negative (negative) 
coefficients). OLS is used rather than a fixed or random effects model. Results from Breusch and Pagan (1980) 
LM tests indicate that OLS is the appropriate estimation method. | 

* Variables are defined as follows (all variables are scaled by average total assets): E T income before 
extraordinary items; ATE = aggregate earnings excluding the tax change, measured as PTE(1 — ETR,_,); and 
TCC = the tax change component of earnings, measured as e AA ETR). 

* КО;: Is the tax change component of earnings useful in forecasting future earnings incremental to earnings 
excluding the tax change? | 


1 
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when annual ETRs decrease (increase), the coefficient on TCC, is 0.7373 (0.7198) with a 
t-statistic of 3.71 (3.08). These results contradict the maintained hypothesis in prior liter- 
ature that earnings generated by ETR changes are transitory. 

If the earnings innovation to TCC, were completely persistent, then y, in Table 3 would 
be 1. However, the magnitude of the coefficient on TCC, in each specification of Equation 
(2) indicates that TCC, contains transitory elements. The results in the next section should 
help to identify the origins of the persistence of the tax change component of earnings. 


The Persistence and Forecasting Implications of the Tax Change Component of 
Earnings in Interim Periods 

Table 4 reports the estimation results for Equation (3), in which future earnings are 
regressed on current earnings, the initial tax change component, and the revised tax change 
component." The coefficient on ATE, in Panel A is 0.8137 (t = 34.02). The coefficient on 
INTCC, is 0.9398 (t = 7.50) and the coefficient on REVTCC, is 0.1116 (t = 0.72). The test 
of H, rejects the hypothesis that 6, = 8, (t = 4.89). Therefore, for the full sample, the 
initial tax change component of earnings is more persistent for future earnings than the 
revised tax change component of earnings. 

Panel B of Table 4 reports the results of estimating Equation (3) for firms with annual 
ETR decreases. The coefficient on INTCC, is 0.9627 (t = 3.65) and the coefficient on 
REVTCC, is 0.2909 (t = 2.20). The test of Н, rejects the hypothesis that $, = $, (t = 2.53). 
Results from estimating Equation (3) for firms with annual ETR increases (Panel C) show 
that INTCC, is also more persistent for future earnings than REVTCC, (8, = 0.9144, 
t = 4.29; 8, = —0.0505, t = —0.15). The test of H, rejects the hypothesis that 8, = 8, 
(t — 6.09). 

Prior literature has generally found that the coefficient on pretax earnings is greater 
than that on ETR-related earnings. Although this is not the focus of my study, unreported 
tests indicate that the difference in the persistence of the tax (TCC and INTCC) and earnings 
coefficients (ATE) is not statistically significant. The failure to reject the null hypothesis of 
no difference could be seen as further evidence against the hypothesis that ETR-related 
earnings are transitory. 

When I decompose REVTCC, to examine the forecasting implications of the tax change 
component for each individual quarter (Q2, Q3, and Q4), the results indicate that the tax 
change component becomes more transitory (less persistent) as the year progresses (1.6., 
Q1 > Q2 > Q3 > Q4). The coefficient estimates for TCC, in each quarter are: Q1 
= 0.9546; 02 = 0.4836; Q3 = 0.0937; Q4 = —0.1838. Both Q1 and 02 are significant at 
p « 0.05. 

I also find that 6, is greater than zero and 8, is not statistically different from zero in 
all reported specifications of Equation (3). These results suggest that, on average, manage- 
ment's initia] ETR estimate reflects the persistent information contained in the annual tax 
change. By decomposing TCC, into an initial and revised portion, I identify a persistent 


27 Results of a series of F-tests indicate that the decomposition of earnings into these three components is more 
informative than a univariate regression of future earnings on current earnings (F = 81.61, p = 0.0000) or a 
regression of future earnings on current earnings and the aggregate tax change component (F = 56.08, p 
= 0.0000). 

28 The phrase “оп average” should be emphasized. If I partition the sample into firms that have increases in INTCC 
and decreases in REVTCC, then the persistence estimates of TCC (in Equation (2)) and INTCC (in Equation 
(3)) are not significantly different from zero. However, INTCC is still significantly larger that REVTCC in this 
situation. 
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TABLE 4 
Summary Regression Statistics of the Forecasting Implications of the Initial and Revised Tax 
Change Component of Earnings* 
Ej, = бу + 8,ATE, + &INTCC, + 5,КЕУТСС, tv, (3) 

Panel A: Full Sample (п = 8) 
Parameters” бо 8, 5, 8, R? H,: 5, > бу 
Estimate 0.0092 0.8137, 0.9398 0.1116 0.6562 0.8281 
t-statistic 9.33* 34.02* 7.50* 0.72 37.12* 4.89* 
Pos/(Neg) 8/(0) 8/(0) 8/(0) 6/(2) 7(1) 
Panel B: ETR Decreases (n — 8) 
Parameters 8, 5, 5, 5, m H,: $, > $, 
Estimate 0.0076 0.8290 0.9627 0.2909 0.6997 0.6718 
t-statistic 6.42* 34.15* 3.65* 2.20 41.50* 2.531 
Pos/(Neg) 8/(0) 8/(0) 7101) 5/(3) 7(1) 
Panel C: ETR Increases (n = 8) 
Parameters . 8, 8, 8, 5, m H: 5, > ё, 
Estimate 0.0106 0.7958 0.9144 —0.0505 0.6225 0.9649 
t-statistic 13.20* 38.60* 4.29* —0.15 26.74* 6.09* 
Pos/(Neg) 8/(0) 8/(0) 8/(0) 5/(3) 8/(0) 


The symbols *, t, and ў denote conventional significance at the 0.01, 0.05, and 0.10 (two-tailed) levels, 
respectively. In order to account for potential serial correlation, I interpret t-statistics of 2.8 or higher on the 
regression coefficients as an indication of significance at the 0.05 level (Fama and French 2000). The symbols 
for tests of H, represent significance levels for one-tailed tests. 

* This table presents summary statistics of annual cross-sectional regressions estimated using the Fama and . 
MacBeth (1973) approach. I use average slopes and their time-series standard errors to draw inferences (mean 
coefficients are reported first followed by the t-statistic and the number of yearly positive (negative) 
coefficients). OLS is used rather than a fixed or random effects model. Results from Breusch and Pagan (1980) 
LM tests indicate that OLS is the appropriate estimation method. 

> Variables are defined as follows (all variables are scaled by average total assets): E = income before 
extraordinary items; ATE = aggregate earnings excluding the tax change, measured as PTE(1 — ETR,..,); 
INTCC = the initial tax change component of earnings, measured as PTE(ETR, , — ETRQI,); and КЕУТСС = 
the revised tax change component of earnings, measured as PTE(ETRQI, — ETR,). 

*H,: The initial (Q1) tax change component of earnings is more persistent for future earnings than subsequent 
"revisions (Q2, Q3, Q4) in the tax change component of earnings. 


element of the tax change component of earnings, which further dispels the notion that 
earnings generated by ETR changes are transitory. 


Valuation Implications of the Tax Change Component of Earnings 

Panel A of Table 5 reports results from estimating the system of Equations (2) and (4) 
on the full sample. For ATE, the valuation coefficient (уж = 0.9131) is larger than the 
forecasting coefficient (y, = 0.8149), which suggests that the market overestimates the 
persistence of earnings excluding the tax change relative to its ability to forecast one-year- 
ahead earnings. The likelihood ratio statistic reported in Panel A is statistically significant 
ОСИ) = 6.3403, p = 0.009). The market appears to underestimate the persistence of TCC; 
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TABLE 5 
Nonlinear Generalized Least Squares Estimation of the Pricing of Earnings Components" 


Panel A: Market Pricing of the Tax Change Component with Respect to its Implications for 


One-Year-Ahead Earnings 
Eni = Yo + WATE, + ТСС, + уд (2) 
АК, = Фо + ф(Е — Yo — УТАТЕ, - YITCC,) + вал (4) 
Parameters” л: Үз у чі 
Estimate 0.8149 0.6914 0.9131 —0.2276 
Asymptotic SE 0.0055 0.0669 0.0391 0.4707 
Test of Market Efficiency:° ‘N= ү = ү Ye = VE 
Likelihood Ratio Statistic: 6.3403* 3.7913* 10.9818* 


Panel B: Market Pricing of the Initial and Revised Tax Change Component with Respect to 
their Implications for One-Year-Ahead Earnings 


Ела = бу + ЗАТЕ, + &JNTCC, + 8,REVTCC, + v, (3) 
АК, = Фо + о(Е — бо — SPATE, — &HNTCC, — 8 КЕУТСС, + &, (5) 

Parameters 5, 5, 8, of ef 51 

Estimate 0.8176 0.9147 0.0977 0.9179 0.2571 —1.4850 

Asymptotic SE 0.0055 0.0731 0.1036 0.0388 0.5080 0.7243 

Test of Market Efficiency:? 5, = + 8, = 8» 8; = 8f ‚ = oF 

Likelihood Ratio Statistic: 6.7291* 1.6526 4.8654* 12.4737* 


The symbols *, t, and $ denote significance at the 0.01, 0.05, and 0.10 (two-tailed) levels, respectively. 
* This table reports the estimation results of the Mishkin (1983) framework. The two systems of equations are 
jointly estimated using an iterative nonlinear generalized least squares estimation procedure based on 12,695 
Observations during 1994—2001. 

> Variables are defined as follows (all variables are scaled by average total assets): E = income before 
extraordinary items; ATE = aggregate earnings excluding the tax change, measured as PTE(1 — ETR,_,); TCC 
== the tax change component of earnings, measured as PTE(ETR,., — ETR); INTCC = the initial tax change 
component of carnings, measured as PTE(ETR, ., — Ph and REVTCC - the revised tax change 
component of earnings, measured as РТЕДЕТКОЇ, = 

* На: The earnings expectations embedded in stock prices dd to fully reflect the persistence of the tax change 
component of earnings. 

3 H4: The earnings expectations embedded in stock prices fail to fully reflect the persistence of the initial (Q1) 
and revised (Q2, Q3, Q4) tax change component of earnings. 


the valuation coefficient (yf = —0.2276) is significantly smaller than the forecasting co- 
efficient (у, = 0.6914; х2(1) = 3.7913, p = 0.048). 

Panel B of Table 5 reports results from the estimation of the system of Equations (3) 
and (5) on the full sample. The results for ATE, are similar to those reported in Panel A: 
the valuation coefficient (8* = 0.9179) is significantly larger than the forecasting coefficient 
(8, = 0.8176), which suggests that the market overestimates the persistence of earnings 
relative to its ability to forecast one-year-ahead earnings. The market appears to underes- 
timate both the persistence of INTCC, and the persistence of REVTCC,. Specifically, the 
valuation coefficient (5* = 0.2571) is smaller than the forecasting coefficient (5, = 0.9147) 
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for ПҮТСС, and the valuation coefficient (6 = — 1.4850) is less than the forecasting co- 
efficient (6, = 0.0977) for REVICC, The likelihood ratio statistics reported in Panel B 
indicate that the mispricing of INTCC, is not statistically significant (x?(1) = 1.652, р 
= 0.199), while the mispricing of REVTCC, is statistically significant (х2(1) = 4.865 
р = 0.027).? These results suggest that investors do not fully understand the implications 
of the transitory nature of the revised tax change component. 

The lower valuation coefficients (relative to the forecasting coefficients) on TCC and 
REVTCC are consistent with a market that punishes firms for engaging in behavior it deems 
representative of earnings management. DeFond and Park (2001), Bartov et al. (2002), and 
Gleason and Mills (2004) find that the market discounts the reward to beating earnings 
expectations when earnings appear to be managed. 


Robustness Checks and Sensitivity Analyses 
Tax (or Benefit) Applicable to Significant Unusual or Infrequently Occurring Items, 
Discontinued Operations, or Extraordinary Items 

A provision of APB No. 28 excludes taxes related to "significant unusual or extraor- 
dinary items that will be separately reported or reported net of their related tax effect” from 
the estimated annual ETR calculation (APB No. 28, 919). Instead, these items should be 
recognized in the interim period in which they occur (APB No. 28, 121). To the extent that 
these significant unusual or extraordinary items are not randomly distributed throughout the 
year, they could drive the results for H,. A similar concern exists with respect to in-process 
research and development (IPRD), a large ETR reconciliation item that is transitory. Be- 
cause IPRD is not persistent, the transitory, nature of REVTCC could be due to IPRD 
(instead of earnings management) to the extent that IPRD is either not predicted well in 
Q1 or IPRD is booked disproportionately in Q2—Q4. To address this concern, I identify 
firms that reported a merger or acquisition, divestiture, special item, or extraordinary item 
(Compustat item numbers FN1, 66, 17, and 192, respectively), exclude them from the 
sample (leaving a sample of 10,375 firm-year observations), and rerun the tests for H,. 
Untabulated results indicate that the null continues to be rejected for each hypothesis. 


Abnormal Returns to a Hedge Strategy 

The findings from the Mishkin (1983) test suggest that stock prices do not accurately 
reflect the persistence of the tax change component of earnings and this mispricing is driven 
by the revised tax change component of earnings. In order to control for additional risk 
and anomaly variables, I supplement the Mishkin (1983) tests by estimating return regres- 
sions using the Fama and MacBeth (1973) technique. 

The Fama and MacBeth (1973) approach calls for the formation of zero-investment 
hedge portfolios. To create long and short positions of the hedge portfolios, I first rank 
firms on the magnitude of ATE, and equally assign firms to one of ten portfolios each year. 
Next, within each of the ten ATE portfolios, I assign firms equally into deciles based on 
INTCC, and REVTCC, I divide each INTCC, and REVTCC, decile number by 9 so each 
observation related to INTCC, or REVTCC, takes on a value ranging from 0 (for the lowest 
decile) to 1 (for the highest decile). I form the portfolios in this manner in order to rank 
firms on INTCC, ог REVICC, while controlling for the level of ATE, Firms in the lowest 
decile are firms with highest values of INTCC, and REVICC, for a given level of ATE, 


29 Because the data used in the Mishkin estimation are pooled, I also estimate the system of Equations using the 
Fama and MacBeth (1973) approach, the benefits of which are discussed in footnote 26. The coefficient estimates 
from each equation are qualitatively similar, and a two-tailed difference of means t-test of the hypothesis that 
the mean valuation and forecasting coefficients of REVTCC are equal is rejected (t = 2.38, p = 0.046). 
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while firms in the highest decile have the lowest values of INTCC, and REVTCC, for a 
given level of ATE,. The hedge portfolio is formed by taking a long (short) position in firms 
in the highest (lowest) decile of INTCC, or REVTCC,. 

І implement the Fama and MacBeth (1973) approach by estimating the following cross- 
sectional regression for each of the eight years in the sample: 


AR, = Po + ФИМТСС? + ф,КЕУТССЕ + Бал (6) 


where INTCC* represents the scaled portfolio decile rank of the initial tax change com- 
ponent of earnings and REVT'CC* represents the scaled portfolio decile rank of the revised 
tax change component of earnings. In order for the regression coefficients to be interpreted 
as zero-investment portfolio returns, the returns used in the regressions must be for the 
same time period and the accounting information used to create the hedge portfolios must 
coincide chronologically and be publicly available at the time of portfolio formation. Thus, 
I estimate Equation (6) using firms with December year-ends only, which reduces my 
sample size to 8,061 firm-year observations. To further insure that the information is avail- 
able to the market during the return calculation period, I also require firms to have Com- 
pustat earnings announcement dates for all four quarters in the subsequent year and these 
dates must fall within the 12-month buy-and-hold return interval (Thomas 2000). 

The results from the Fama and MacBeth (1973) regressions in Panel A of Table 6 
generally confirm the results of the Mishkin (1983) test. There is an insignificant, posi- 
tive association between INTCC, and future returns in Equation (6). There is also a positive 
association between REVTCC, and future returns in Equation (6), however the association 
is marginally significant (Ф, = 0.0487, t = 2.09). The positive sign on КЕУТСС, is con- 
sistent with the difference in historical and security market weightings documented using 
the Mishkin (1983) framework. Because the market underestimates the persistence of 
REVICC, there should be positive abnormal returns for portfolios ranked on REVTCC,. | 
Thus, the abnormal return to a trading strategy based on REVTCC, is 4.87 percent. 


Controlling for Additional Risk Factors and Market Anomalies 

Although my measure of abnormal returns controls for three of the four commonly 
identified risk factors (Fama and French 1993; Carhart 1997), I also estimate the following 
regression tó ensure that any relation between the revised tax change component of earnings 
and future abnormal returns is incremental to other risk factors and market anomalies 
identified in prior research: 


АК, = Фо + ФЇМТСС = + ф.ВЕУТССІЄЄ + ФАВЕТА о + SPEC} 
+ Ф-РТАХ + p VOL} + ФЕР? + o,PTXCF; 
+ ФРТХАСС + Ena (7) 


where BETA, = the slope coefficient from a regression of raw monthly returns on the value- 
weighted portfolio return over a 60-month period prior to the abnormal return accumulation 
period (Fama and French 1992), SPEC, = special items (annual Compustat item #17), 
computed as special items divided by average total assets (Elliott and Hanna 1996; Easton 
et al. 2000; Burgstahler et al. 2002), DTAX, = deferred income tax expense (annual Com- . 
pustat item #50) divided by average total assets (Chaney and Jeter 1994; Philips et al. 2003; 
Joos et al. 2003; Hanlon 2005), VOL, — the firm-specific annual trading volume from CRSP 
divided by common shares outstanding (Barth and Hutton 2004), EP, — the earnings to 
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price ratio (Basu 1977), PTXCF, = total pretax operating cash flow, measured as operating 
cash flow (annual Compustat item #308) minus cash flow due to extraordinary items (annual 
Compustat data item #124) plus taxes paid in cash (annual Compustat data item #317) 
divided by average total assets (Hanlon 2005), PTXACC, = total pretax operating accruals, 
measured as pretax income minus pretax operating cash flows divided by average total 
assets (Sloan 1996; Collins and Hribar 2002). All risk and market anomaly controls are 
transformed into scaled decile variable with values ranging from 0 to 1. The inclusion of 
the additional control variables reduces the sample size used to estimate Equation (7) to 
5,828 observations. The results from the Fama and MacBeth (1973) regressions in Panel 
B of Table 6 indicate that the marginally significant, incremental abnormal returns related 
to REVICC, persist after controlling for additional risk factors and market anomalies (9, 
= 0.0594, t = 2.55). 


VL CONCLUSIONS 

I examine whether earnings generated by changes in ETRs (the tax change component) 
persist and aid in forecasting future earnings incremental to aggregate earnings excluding 
the tax change. Additionally, this study extends prior literature by testing whether the market 
fully incorporates the pricing effects of the persistence of the tax change component of 
earnings. . 

I find that there is a positive, significant association between the tax change component 
of earnings and future earnings. When I decompose the tax change component into an 
initial and revised portion, I find that the initial tax change component is more persistent 
and, therefore, more useful in forecasting future earnings than the revised tax change com- 
ponent. These results are consistent with my hypotheses that the initial and revised tax 
change components have differential persistence and forecasting implications, and dispute 
the broad notion that ETR-related earnings are transitory. Results from market tests indicate 
that investors underweight the forecasting implications of the tax change component and 
the mispricing appears to be driven by the transitory nature of the revised tax change 
component. Supplemental tests confirm that the results obtained from the market tests are 
likely a result of mispricing rather than alternative explanations such as omitted risk factors. 

The results also suggest that investors can gain from breaking the tax change component 
of earnings into an initial and revised portion. To the extent that investors focus on the 
initial tax change component, they can better understand the persistence of earnings gen- 
erated by ETR changes and, consequently, make better forecasts of future earnings. Such 
an understanding should also decrease the mispricing of ETR-related earnings. Investors 
may also benefit by focusing on the relative lack of persistence for the revised tax change 
component. When viewed in the context of concurrent research that finds Q2, Q3, and Q4 
ETR revisions are used to meet or beat period-specific benchmarks (e.g., analyst forecasts), 
the lower persistence of the revised tax change component is consistent with managers 
using the revised tax change component as an earnings management tool. Increased atten- 
tion to this issue may mitigate such behavior by managers. 
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I. INTRODUCTION 

rior research compares discretionary accrual proxies for firms that just beat and just 

miss earnings benchmarks to investigate whether firms use discretionary accruals to 

beat earnings benchmarks (e.g., Dechow et al. 2003; Phillips et al. 2003). One dis- 
advantage of studies that focus solely on comparisons centered on earnings benchmarks, 
however, is that these comparisons may not adequately control for the possibility of a 
systematic association between discretionary accrual proxies and firm performance. In par- 
ticular, it is difficult to interpret a positive association between discretionary accrual proxies 
and the likelihood of beating an earnings benchmark if they also represent firm perform- 
ance.' This study investigates the competing explanations for the positive associations be- 
tween discretionary accrual proxies and firms' propensity to beat earnings benchmarks. 

In our first analysis, we use probit regressions to test whether the positive associations 
between discretionary accrual proxies and beating the profit, earnings increase, and analysts' 
forecast benchmarks hold for comparisons of groups segregated at other points in the dis- 
tributions of earnings, earnings changes, and analysts-based unexpected earnings. We refer 
to these points as "pseudo" targets. We compare firm-year observations across adjacent 
profit “bins” and examine the association between discretionary accrual measures and the 
probability that a firm reports a higher profit (i.e., resides in the next higher bin). We 
conduct similar analyses across the distributions of earnings changes and analysts-based 
unexpected earnings. We use total accruals, deferred tax expense, modified Jones (1991) 
model abnormal accruals, and forward-looking model abnormal accruals as alternative dis- 
cretionary accrual proxies. Ex ante, if the positive associations between discretionary ac- 
crual proxies and beating the actual earnings benchmarks are solely attributable to earnings 
management to beat the benchmarks, then one would expect significant positive associations 
between the discretionary accrual proxies and beating these pseudo targets by chance in no 
more than 10 percent of these comparisons where incentives to beat the pseudo targets 
should not exist.” 

In our second analysis, we test whether the association between discretionary accrual 
proxies and reporting higher profits becomes more pronounced around the actual profit 
benchmarks. We conduct similar analyses for the actual earnings increase and analysts' 
forecast benchmarks. It is possible that there is an underlying significant positive association 
between discretionary accrual proxies and firm performance (or, alternatively that firms 
manage earnings throughout the earnings distribution), but the association intensifies around 
earnings benchmarks where earnings management incentives are stronger. Evidence of an 
increased association between discretionary accrual proxies and firm performance around 
the actual benchmarks would be consistent with firms using discretionary accruals to meet 
or beat earnings benchmarks. We test whether the positive asscciation between discretionary 
accrual measures and beating each actual benchmark is significantly larger than (1) the 
average association between the discretionary accrual measures and beating the pseudo 
targets and (2) the association between discretionary accruals and beating each pseudo 
target. 

We find significant positive associations between discretionary accrual proxies and beat- 
ing the actual profit benchmark. However, results suggest that these positive associations 





- 


McNichols and Wilson (1988), Dechow et al. (1995), Kasznik (1999), McNichols (2000), and Dechow et al. 
(2003), among others, demonstrate that discretionary accruals increase with earnings. 

? Our analyses require us to choose a probability level to quantify the number of bins that should be rejected by 
chance. We choose 10 percent as the probability cut-off for all statistical tests because empirical studies generally 
consider p = .10 to be marginally significant. Conclusions are similar when we use a more narrow probability 
cut-off (e.g., 5 percent). 
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extend to a significantly larger number of pseudo targets than expected by chance. This 
evidence indicates that demonstrating positive associations between a specific discretionary 
accrual measure and beating the profit benchmark is not sufficient to conclude that the 
discretionary accrual measure detects earnings management to beat the benchmark. We 
draw similar conclusions for the earnings increase benchmark. 

Results from our second analysis indicate that the associations between discretionary 
accrual proxies and reporting higher earnings intensify around the actual profit benchmark. 
We find similar effects around the actual earnings increase benchmark. However, analogous 
patterns exist for cash flows around the profit and earnings increase benchmarks. It is 
possible that firms manage both cash flows and discretionary accruals around the profit and 
earnings increase benchmarks (Burgstahler and Dichev 1997). Alternatively, as suggested 
in Dechow et al. (2003), managers and employees may simply work harder around earnings 
benchmarks, which could explain, in part or in aggregate, the larger associations around 
the profit and earnings increase benchmarks. In sum, we are unable to eliminate other 
plausible explanations for the larger positive associations between discretionary accrual 
proxies and beating the profit and earnings increase benchmarks. 

For the actual analysts’ forecast benchmark, we find a significant positive association 
between discretionary accrual proxies and beating the benchmark. In contrast to the pseudo 
profit and earnings change analyses, we find few positive associations between discretionary 
accrual proxies and beating the pseudo unexpected earnings targets (1.е., no more than 
expected by chance). In addition, we find no association between changes in cash flows 
and beating the actual analysts’ forecast benchmark. Thus, we are able to draw stronger 
conclusions regarding earnings management around the analysts’ forecast benchmark. 

This study makes several contributions. We demonstrate that the positive associations 
between discretionary accrual measures and reporting higher earnings and more positive 
earnings changes extend to a significantly larger number of pseudo targets than expected 
by chance. In contrast, we find few positive associations between discretionary accrual 
proxies and beating the pseudo targets derived from analysts-based unexpected earnings. 
We draw two conclusions from this evidence. First, the underlying associations between 
firm performance and discretionary accrual measures are less problematic for tests of earn- 
ings management related to analysts-based unexpected earnings. Second, demonstrating 
positive associations between a specific discretionary accrual measure and beating the profit 
and earnings increase benchmarks is not sufficient to conclude that the discretionary accrual 
measure detects earnings management to beat these benchmarks. Among other implications, 
this evidence suggests in which settings the association between discretionary accruals and 
performance may be problematic. 

We develop an additional analysis that specifically accounts for the systematic associ- 
ations between discretionary accruals and earnings and earnings changes. Results from this 
analysis suggest that the associations between discretionary accruals and earnings and earn- 
ings changes intensify around the actual profit and earnings increase benchmarks where 
earnings management incentives may be stronger. However, similar patterns exist for cash 
flows around these benchmarks. Thus, we are unable to conclude that earnings management 
explains the associations between discretionary accruals and beating the profit and earnings 
increase benchmarks. 

One advantage of our analysis relative to performance-matched discretionary accruals 
(Kothari et al. 2005) is that it does not require specifying the exact nature of the correlation 
between discretionary accruals and performance—i.e., it does not require specifying a met- 
ric to match upon (e.g., ROA). Relative to performance-matching, our analysis may result 
in more powerful tests of earnings management around earnings benchmarks as it does not 
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introduce “noise” in the tests via the matching process. Accordingly, this method may 
reduce the likelihood of falsely accepting the null hypothesis of no earnings management 
due to low power tests. 

The remainder of this study is organized as follows. Section II presents our research 
method. Section Ш describes the sample selection, and Section IV presents descriptive 
statistics and results. Section V describes supplemental analyses, and Section VI concludes. 


IL RESEARCH METHOD 

One disadvantage of studies that focus solely on comparisons around earnings bench- 
marks is that they may not adequately allow for the possibility of a systematic association 
between discretionary accrual measures and firm performance irrespective of a firm's earn- 
ings management activities. Indeed, one of the central criticisms associated with using 
discretionary accrual proxies to test for earnings management is that the proxies may be 
capturing nondiscretionary accruals (i.e., differences in firm performance; see Dechow et 
al. 1995; Guay et al. 1996; Kasznik 1999; McNichols 2000). This study provides two 
analyses that may be useful in disentangling the competing explanations for the positive 
associations between discretionary accrual proxies and the propensity to meet or beat earn- 
ings benchmarks. The following paragraphs describe these analyses for the profit bench- 
mark. Subsequent paragraphs discuss analyses for the earnings increase and analysts' fore- 
cast benchmarks. 


Profit Benchmark Analyses 

We test whether the positive associations between discretionary accrual measures (і.е., 
total accruals, deferred tax expense, modified Jones model abnormal accruals, and forward- 
looking model abnormal accruals) and beating the profit benchmark hold for comparisons 
of groups segregated based on pseudo profit targets. Specifically, we compare firm-years 
across adjacent profit "bins" and examine the association between various discretionary 
accrual measures and the probability that a firm reports higher profits (i.e., resides in the 
next higher bin). Dechow et al. (1995), Kasznik (1999), and Dechow et al. (2003), among 
others, suggest that commonly used discretionary accrual proxies (e.g., modified Jones 
model abnormal accruals and forward-looking model abnormal accruals) increase in earn- 
ings. Likewise, for various firms one might anticipate a positive association between de- 
ferred tax expense and firm performance. For example, firms with depreciable assets (e.g., 
manufacturing firms) whose operations are not contracting are likely to maintain a constant 
level of or increase their depreciable assets relative to firms with poor performance. Ceteris 
paribus, new depreciable assets generate a more positive (or less negative) deferred tax 
balance because assets are depreciated more rapidly for tax ccmpared to financial reporting 
purposes. Thus, it is plausible that there may be a positive association between deferred 
tax expense and firm performance.* 

Figure 1 illustrates the comparisons across the actual profit benchmark and pseudo 
profit targets. To estimate the association between discretionary accrual proxies and beating 


3 Prior studies use total accruals (Phillips et al. 2003), deferred tax expense (Phillips et al. 2003), modified Jones 
model abnormal accruals (Payne and Robb 2000; Matsumoto 2002; Phillips et al. 2003), and forward-looking 
model accruals (Dechow et al 2003; Phillips et al. 2003) to investigate earnings management around earnings 
benchmarks 


* Other book-tax differences may lead to a less positive (or more negative) deferred tax balance (e.g., warranty 


expense). Thus, it is an empirical question whether there is a positive association between deferred tax expense 
and firm performance. 
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FIGURE 1 
Pseudo Target and Actual Profit Benchmark Comparisons 
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the actual profit benchmark, we compare discretionary accrual proxies for firms that just- 
beat and just-miss the profit benchmark. In particular, we analyze the associations between 
discretionary accrual proxies and the firm's propensity to reside in the bin to the right of 
the profit benchmark. For the pseudo profit targets, we compare discretionary accrual prox- 
.les for firms that just-beat and just-miss the pseudo profit targets—i.e., we analyze the 
associations between discretionary accrual proxies and the firm's propensity to reside in 
the bin to the right of the pseudo profit targets. 
We estimate the following probit model to test whether alternative discretionary accrual 
measures are positively associated with the probability that a firm meets or beats (1) the 
actual profit benchmark and (2) pseudo profit targets: . 


EM, = a + B,DisAccruals, + B,CashFlows, + Ві» та, + €; (1) 
where: 


ЕМ, = lif X = E, < (X + 0.01) and 0 if (X — 0.01) = Е, < X. We define E, 
as firm i's net income in year ¢ (annual Compustat data item #172) divided 
by the market value of equity at the end of year 1—1 (annual Compustat 
data item #25 X #199), and X as the profit target. X equals 0 for the actual 
profit benchmark;? 

DisAccruals, = one of the following four alternative measures of firm i's discretionary: 
accruals in year f: 


5 We use net income after extraordinary items to be consistent with previous research investigating the propensity 
of firms to beat earnings -enchmarks (Burgstahler and Dichev 1997; Dechow et al. 2003; Phillips et al. 2003). 
Results are similar when we estimate the analyses using net income before extraordinary items (Compustat 
#123). 
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* Total Accruals = total accruals, scaled by total assets (annual Compustat 
#6) at the end of year t—1, is computed as EBEI — Cash Flows, where 
EBEI is income before extraordinary items (annual Compustat data item 
#123), and Cash Flows is cash flows from operations (computed as an- 
nual Compustat data items #308 — #124);5 
Deferred Tax Expense = deferred tax expense (annual Compustat data 
item #50), scaled by total assets (annual Compustat data item #6) at the 
end of year t—1; 
Modified Jones Dis. Accruals = abnormal accruals computed using the 
modified Jones model (Dechow et al. 1995; DeFond and Subramanyam 
1998). We calculate Modified Jones Dis. Accruals as the difference be- 
tween Total Accruals and modified Jones normal accruals. Modified 
Jones model normal accruals are estimated as: Total Accruals, = a 
+ B, (ASales, — AREC,) + В, PPE,, where ASales, equals the change 
in the firm's sales (annual Compustat data item #12) from year t—1 to 
year t, AREC, equals the change in accounts receivable (annual Com- 
pustat data item #302) from year t—1 to year t, and PPE equals the 
firm's gross property, plant, and equipment (annual Compustat data item 
#7). We estimate the Modified Jones model cross-sectionally by year and 
industry for each two-digit SIC industry year with at least ten observa- 
tions. We scale all variables by total assets at the end of year 1—1; 
Fwd-Look Dis. Accruals = abnormal accruals computed using the for- 
ward-looking model (Dechow et al. 2003). We calculate Fwd-Look Dis. 
Accruals as the difference between Total Accruals and forward-looking 
normal accruals. Forward-looking model normal accruals are estimated 
аз: Total Accruals, = а + В, ((1 + k)ASales, - AREC,) + В,РРЕ, 
+ B,Zotal Ассгиайг, | + B,Gr—Sales,,,, where k is the slope coefficient 
from a regression of AREC, on ASales,, (winsorized so that 0 = k = 1) 
Total Accruals, | equals total accruals from year t—1, scaled by total 
assets as of year t—2, and Gr_Sales,,, equals the change in sales from 
year t to 1+1, scaled by year г sales. We estimate the forward-looking 
model cross-sectionally by year and industry for each two-digit SIC in- 
dustry year with at least ten observations; 
Cash Flows, = cash flows from continuing operations (annual Compustat data items #308 
— #124), scaled by total assets as of the end of year t—1; 
Удай, = 1 (0) if firm i is Gs not) in industry j in year 2, based on two-digit SIC 
codes; and 
£, — the error term. 


Following Burgstahler and Dichev (1997), we segregate firms into "profit" bins based 
on the firm's year 1 net income deflated by end of year 1—1 market value of equity. We 
design the pseudo targets and profit bin widths to mitigate the potential that any given firm- 
year observation appears in multiple just-beat or just-miss comparison groups. Each just- 
beat and just-miss bin has a width of .01, and each firm-year observation appears once in 


$ Total accruals are inherently a noisy measure of discretionary accruals. Given that prior research (e.g., Healy 
1985; Phillips et al. 2003) utilizes total accruals as one measure of discretionary accruals, we include data 
regarding how total accruals perform across the various earnings benchmarks using both univariate and regression 
analyses. 
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a just-beat group and once in a just-miss group. We limit our comparisons across pseudo 
targets to firms with deflated earnings between —.10 and .11 to ensure that we have suffi- 
cient observations per bin for our analyses and to restrict our sample to firms with less 
extreme earnings performance.’ Finally, we designate pseudo targets in .01 increments 
beginning at —.09 and ending at .10. Thus, we investigate 19 pseudo profit targets. 

Consistent with prior studies investigating discretionary accruals (e.g., Dechow et al. 
1995; DeFond and Subramanyam 1998; Dechow et al. 2003; Phillips et al. 2003), we deflate 
each discretionary accrual measure by lag total assets. We include cash flows as a control 
for performance (Phillips et al. 2003) or alternatively, to control for the effect that a firm's 
cash flow may have on the firm's need to manage earnings via discretionary accruals. 
Dechow et al. (2003) estimate, based on a linear approximation of the earnings distribution, 
that 85 to 90 percent of firms just beat the profit benchmark without managing earnings. 
Probit analyses controlling for a firm's cash flows should provide more powerful tests of 
earnings management than univariate comparisons that do not control for the firm's need 
to manage earnings via discretionary accruals. Finally, we include two-digit SIC industry 
dummy variables in each probit regression to control for possible differences in the tendency 
to beat earnings targets across industries. 

Ex ante, if the positive associations between discretionary accrual proxies and beating 
the profit benchmark are solely attributable to earnings management to beat the benchmark, 
one would expect significant positive associations between discretionary accrual proxies 
and beating the pseudo profit targets by chance in no more than 10 percent of the com- 
parisons across pseudo targets (i.e., where incentives to beat the targets should not exist). 
This expectation is based upon using a 10 percent significance level to reject the null 
hypothesis. More frequent significant positive associations would be consistent with (1) a 
positive association between discretionary accrual proxies and firm performance (i.e., ir- 
respective of earnings management) and/or (2) firms managing earnings upward across the 
distribution of earnings. Thus, it becomes difficult to distinguish between earnings man- 
agement versus merely an association between discretionary accrual proxies and firm 
performance. 

Implicit in studies analyzing discretionary accruals around earnings benchmarks is the 
expectation that the relationship between firm performance and discretionary accruals shifts 
around benchmarks. Indeed, it is possible that there is an underlying significant positive 
association between discretionary accrual proxies and firm performance (or, alternatively 
that firms manage earnings throughout the earnings distribution), but the association inten- 
sifies around earnings benchmarks where earnings management incentives are stronger. 
Evidence of an increased association between discretionary accrual proxies and firm per- 
formance around the actual profit benchmark would be consistent with firms using discre- 
tionary accruals to beat the benchmark. Accordingly, we test whether the associations be- 
tween discretionary accrual proxies and reporting higher earnings are larger around the 
profit benchmark. We estimate the following probit model: 


7 Firms with deflated earnings between —.10 and .11 represent 59 percent of all firms. Similar restrictions for the 
earnings increase and analysts' forecast benchmark analyses (discussed later) generate samples that represent 66 
percent and 73 percent of all firms, respectively. Results are similar when we expand each sample to include 
firms with more extreme performance (e.g., firms with deflated earnings between —.15 and .16, earnings changes 
between —.15 and .16, and unexpected earnings between negative 15 cents per share and positive 15 cents per 
share). 
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ЕМ, = a, + B,DisAccruals, + B,CashFlows, + Вата, 
+ Actual*(a, + B4DisAccruals, + B,CashFlows, + Віта) + в, (2) 


where Actual = 1 for firm-year observations in the just-miss or just-beat bins around the 
actual profit benchmark, and 0 otherwise. 

All other variables are defined in Equation (1). We estimate two different specifications 
of Equation (2). First, we estimate Equation (2) to determine if the associations between 
discretionary accrual proxies and beating the profit benchmark are significantly greater than 
the average association between the discretionary accrual proxies and beating the pseudo 
targets (i.e., we compare the association between (1) DisAccruals and (2) beating the actual 
profit benchmark to the respective average association for beating the pseudo targets). We 
include the just-beat and just-miss bins for all pseudo targets and the just-beat and just- 
miss bins for the actual profit benchmark in one probit regression. The coefficient for 
Actual*DisAccruals represents the incremental difference in (1) the association between the 
discretionary accrual proxy and beating the actual profit benchmark and (2) the average 
association between the discretionary accrual proxy and beating the pseudo targets. A sig- 
nificant positive coefficient for Actual*DisAccruals would indicate that the association be- 
tween the discretionary accrual proxy and reporting higher earnings becomes more pro- 
nounced around the actual profit benchmark, which would be consistent with earnings 
management to beat the profit benchmark. 

We estimate Equation (2) a second time to determine if the associations between dis- 
cretionary accrual proxies and beating the profit benchmark is significantly greater than the 
associations between the discretionary accrual proxies and beating each pseudo target (i.e., 
we compare the associations between (1) DiscAccruals and (2) beating the actual profit 
benchmark to the respective associations for each pseudo target). We estimate Equation (2) 
once for each specific pseudo profit target. In these analyses, there are four bins included 
in each analysis. We include the just-miss and just-beat bins around the actual profit bench- 
mark and the just-miss and just-beat bins around the pseudo profit target. Similar to above, 
a significant positive coefficient for Actual*DisAccruals would indicate that the association 
between the discretionary accrual proxy and reporting higher earnings becomes more pro- 
nounced around the actual benchmarks, which would be consistent with earnings manage- 
ment to beat the profit benchmark. 


Earnings Increase Benchmark Analyses 

We make three changes to Equations (1) and (2) to investigate the associations between 
discretionary accruals and beating the actual earnings increase benchmark and pseudo earn- 
ings change targets. First, we segregate firms into “earnings change” bins based on the 
firm's year t earnings change deflated by end of year 1—2 market value of equity. Similar 
to the profit benchmark analyses, each just-beat and just-miss bin has a width of .01, and 
each firm-year observation appears once in a just-beat group and once in a just-miss group. 
We investigate 19 pseudo targets consisting of firms with deflated earnings changes between 
—.10 and .11. Second, we redefine ЕМ, to equal 1 if X = AE, < (X + 0.01) and 0 if (X 
- 0.01) = AE, < X. We define AE, as the change in firm i's net income from year 1—1 
to t divided by the market value of equity at the end of year t—2, and X as the earnings 
change target. X equals 0 for the actual earnings increase benchmark. Third, we replace 
CashFlows in Equations (1) and (2) with ACashFlows to control for performance (Phillips 
et al. 2003). We define ACashFlows as the change in firm i's cash flows from continuing 
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operations (annual Compustat data items #308 — #124) from year t— 1 to t, scaled by total 
assets at the end of year 1—1. 


Analysts! Forecast Benchmark Analyses 

We make three modifications to Equations (1) and (2) for the analysts' forecast bench- 
mark analyses. First, we assign firms to “analysts-based unexpected earnings" bins based 
on the firm's unexpected earnings per share (in cents). We define UE, (unexpected earnings) 
as firm i's year t actual earnings per share minus the single-most recent analyst fore- 
cast provided prior to the earnings announcement, both available from the Unadjusted 
I/B/E/S Detail History file. Consistent with prior research (e.g., Degeorge et al. 1999; 
Payne and Thomas 2003; Phillips et al. 2003), we round UE, to the nearest penny. Each 
just-beat and just-miss bin has a width of .01, and each firm-year observation appears once 
in a just-beat group and once in a just-miss group. We investigate 19 pseudo targets in 
these analyses consisting of firms with unexpected earnings between negative ten cents per 
share and positive ten cents per share. Second, we redefine EM, to equal 1 if UE, equals 
X cents per share, and O if UE, equals X less one cent per share. We define X as the 
unexpected earnings target, and X equals 0 for the actual analysts’ forecast benchmark.? 
Third, we replace CashFlows in Equations (1) and (2) with ACashFlows in these analyses 
to control for performance (Phillips et al. 2003). 

We use the single-most recent analyst forecast issued prior to the earnings announce- 
ment from the Unadjusted I/B/E/S Detail History file as our forecast benchmark. O’Brien 
(1988) and Brown (1991) suggest that the single-most recent analyst forecast is more ac- 
curate in predicting actual earnings than the consensus mean forecast. Likewise, Brown and 
Kim (1991) find that the single-most recent analyst forecast more accurately reflects the 
market's earnings expectation than the consensus mean forecast. Assuming that firms intend 
to meet or beat market expectations, using a more current forecast proxy should provide a 
more powerful test of whether firms use discretionary accruals to "meet or beat analyst 
forecasts. This may be especially important since recent research suggests that firms “walk 
down" analyst forecasts during the sample period (Richardson et al. 2004). 


Ш. SAMPLE SELECTION 

We consider four discretionary accrual measures in our tests, including total accruals, 
deferred tax expense, modified Jones model abnormal accruals, and forward-looking model 
abnormal accruals. To facilitate a consistent measure of deferred tax expense across our 
sample period, we restrict the sample to post-SFAS No. 109 deferred tax expense measures 
(1.е., firm-years from 1994-2002). We exclude firms not incorporated in the U.S., mutual 
funds (SIC code 6726), trusts (SIC code 6792), REITS (SIC code 6798), limited partner- 
ships (SIC code 6799), and other flow-through entities (SIC code 6795) because these firms 
are either primarily subject to taxation outside the U.S. or do not account for income tax 
expense. We exclude utilities (SIC codes 4900—4999) and financial institutions (SIC codes 
6000—6099) as regulated firms likely face different earnings management incentives than 
nonregulated firms. To mitigate the effect of extreme observations, we delete firm-years 
with a deferred tax expense in the top or bottom one percentile in each comparison or with 
a scaled total accrual, modified Jones model abnormal accruals, or forward-looking model 


5 Results are similar when we (1) designate firms with UE equal to one cent per share as the just-beat group and 
eliminate sample firms with UE equal to zero cents per share or (2) designate firms with UE equal to zero or 
one cent per share as the just-beat group. 
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abnormal accruals greater than 100 percent (in absolute value) of lagged total assets 
(DeFond and Subramanyam 1998; Dechow et al. 2003; Phillips et al. 2003). Finally, each 
firm-year observation must have sufficient information available to calculate the requisite 
variables in the probit analysis. 


IV. DESCRIPTIVE STATISTICS AND RESULTS 
Profit Benchmark Analyses 


Table 1, Panel A presents the sample size for each profit bin and the mean values of 
each discretionary accrual measure. In each row, we test whether the mean values of the 
discretionary accrual measures for that particular profit bin is significantly higher (i.e., more 
positive or less negative) than the mean values of the discretionary accrual measures in the 
preceding row. We denote significant differences (p = .10, one-tailed test) with bold text 
and identify the "actual" just-beat and just-miss groups with "boxed" text. To provide 
additional evidence regarding the potential positive correlation between discretionary ac- 
crual proxies and firm performance, the last row of Panel A reports the correlation between 
the mean values of discretionary accrual measures for each bin and the bin number, which 
ranges from 1 (the most negative bin) to 21 (the most positive bin). Table 1, Panel B 
presents the Pearson correlations for each variable (i.e., not the mean values per bin). 

The profit benchmark sample includes 22,903 firm-year observations when Total Ac- 
cruals, Deferred Tax Expense, or Modified Jones Dis. Accruals proxies for discretionary 
accruals and 19,045 firm-year observations when Fwd-Look Dis. Accruals proxies for dis- 
cretionary accruals. The Fwd-Look Dis. Accruals sample is smaller because estimating Fwd- 
Look Dis. Accruals requires two additional years of data (i.e., to estimate Total Accruals,, , 
and Gr_Sales,,,). 1,180 observations reside in the actual just-beat bin (ie. 00 = E, 
« .01) and 683 reside in the actual just-miss bin (ie., —.01 = E, « .00) when Total 
Accruals, Deferred Tax Expense, or Modified Jones Dis. Accruals proxies for discretionary 
accruals. 963 observations reside in the actual just-beat bin and 548 reside in the actual 
just-miss bin when Fwd-Look Dis. Accruals proxies for discretionary accruals. Similar to 
prior research, when we graph the frequency of bin observations for the profit benchmark 
sample, we observe a “kink” in the distributions to the left of the zero profit bin. 

Panel A of Table 1 indicates that Total Accruals, Deferred Tax Expense, and Fwd-Look 
Dis. Accruals are significantly larger (i.e., more positive or less negative at р = .10, one- 
tailed test) in the .00 = E, < .01 bin (i.e., the just-beat bin) relative to the —.01 = E, 
< .00 bin (i.e., the just-miss bin).? Nonetheless, similar results hold for five of the 19 other 
comparisons for Total Accruals, seven of the 19 other comparisons for Deferred Tax Ex- 
pense, and four of the other 19 comparisons for Fwd-Look Dis. Accruals (p = .10, one- 
tailed test). These univariate comparisons suggest that the positive associations between 
discretionary accrual measures and earnings extend to other earnings bins not centered on 
the zero profit benchmark. We also note, as might be expected, that CashFlows increases 
in earnings. In ten of 20 comparisons, the mean value of CashFlows is statistically larger 
(more positive or less negative at p = .10) than its value in the preceding bin. 


? Prior research (Dechow et al. 2000, 2003; Phillips et al. 2003) report similar results for univariate comparisons 
of Total Accruals, Deferred Tax Expense, and Modified Jones Dis. Accruals for just-beat and just-miss firms 
around the actual profit benchmark. For Fwd-Look Dis. Accruals, both Dechow et al. (2003) and we report a 
significantly larger amount for just-beat firms (p = .07 and p = .01, respectively, one-tailed test), while Phillips 

. et al. (2003) report no significant difference. When we limit our sample to the period investigated in Dechow 
et al. (2003), we find a significantly larger Fwd-Look Dis. Accruals for just-beat firms (p = .06, one-tailed test). 
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The last row of Table 1, Panel A indicates that the mean values of the discretionary 
accrual proxies are highly positively correlated with the earnings bin number. The Pearson 
correlation coefficients for Total Accruals, Deferred Tax Expense, Modified Jones Dis. Ac- 
cruals, and Fwd-Look Dis. Accruals are .97, .92, .90, and .89, respectively, and each is 
statistically significant (p = .00). Panel B presents the Pearson correlation coefficients for 
each variable. Total Accruals, Deferred Tax Expense, Modified Jones Dis. Accruals, and 
Fwd-Look Dis. Accruals are each positively correlated with earnings (Е,) (р = .10, two- 
tailed test). In sum, univariate analyses suggest that the associations between discretionary 
accrual proxies and reporting higher earnings may not be unique to comparisons centered 
on the profit benchmark. 


Comparisons across Pseudo Targets 

Table 1, Panel C reports the results of the pseudo target and actual benchmark analy- 
ses.!° The fourth through seventh columns present the coefficients for the alternative dis- 
cretionary accrual measures. In the last column we present the coefficients for Cash Flows 
from the regression including Fwd-Look Dis. Accruals as the discretionary accrual proxy. 
The pattern of significant coefficients for Cash Flows is similar when we use Total Accruals, 
Deferred Tax Expense, or Modified Jones Dis. Accruals аѕ Ње discretionary accrual proxy. 
Each row represents the results for estimating Equation (1) for a different just-beat and 
just-miss sample. We highlight positive and significant coefficients (p = .10, one-tailed test) 
with bold text and identify the actual just-beat and just-miss groups with boxed text. In 
the last two rows we tally the number of significant positive coefficients (k) for the pseudo 
target regressions for each discretionary accrual measure (p = .10, one-tailed test) and 
present the probability of “drawing” К or more significant positive coefficients assuming a 
binomial distribution and 10 percent likelihood of drawing a significant positive coefficient 
by chance. 

We find in Panel С of Table 1 that Total Accruals, Deferred Tax Expense, Modified 
Jones Dis. Accruals, and Fwd-Look Dis. Accruals are each positively associated with the 
probability of beating the actual profit benchmark (p = .01, one-tailed test). Results suggest, 
however, that the significant positive associations between these discretionary accrual prox- 
ies and firm profits exist for more tban the expected number of comparisons of adjacent 
bins segregated based on pseudo profit targets. Of the 19 comparisons across adjacent bins 
segregated based on pseudo profit targets, we find a significant positive association (p 
= .10, one-tailed test) between beating the pseudo target and (1) Total Accruals in: 12 
comparisons (63 percent), (2) Deferred Tax Expense in six comparisons (32 percent), (3) 
Modified Jones Dis. Accruals in seven comparisons (37 percent), and (4) Fwd-Look Dis. 
Accruals in ten comparisons (52 percent). Assuming a binomial distribution and 10 percent 
likelihood of drawing a significant positive coefficient by chance, the frequency of positive 
significant associations for each discretionary accrual measure is significantly higher than 
expected (p = .01, one tailed test). Accordingly, it is difficult to distinguish whether the 
associations between Total Accruals, Deferred Tax Expense, Modified Jones Dis. Accruals, 
and Fwd-Look Dis. Accruals and beating the actual profit benchmark are attributable to 
earnings management or the associations between these discretionary accrual proxies and 
firm performance. Finally, as might be expected, Panel C indicates that CashFlows is pos- 
itively related to beating the actual profit benchmark and is also positively associated with 
beating ten of the 19 pseudo profit targets (p = .10, one tailed test). 


1? Our probit regression results comparing just-beat and just-miss firms around the actual profit benchmark are 
similar to the probit results reported in Phillips et al. (2003). 
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Tests for Shifts in the Association between Discretionary Accrual Measures and 
Firm Performance around the Profit Benchmark 

We estimate Equation (2) to test whether the associations between discretionary accrual 
proxies and firm performance increase around the actual profit benchmark (i.e., where 
earnings management incentives may be stronger). In Table 1, Panel C, we identify with 
superscripts c and d those regression coefficients from: the pseudo target analyses that are 
significantly smaller (р = 05 and p = .10, one-tailed test, respectively) than the estimated 
coefficient from the actual profit benchmark (i.e., significantly smaller than the coefficients 
highlighted in boxed text). 

In Table 1, Panel D, we report the results when we compare the associations between 
discretionary accrual proxies and beating the actual profit benchmark to the average as- 
sociation between the discretionary accrual proxies and beating the pseudo targets. Com- 
paring the coefficients for DisAccruals estimated for the actual benchmark and pseudo 
targets via Actual*DisAccruals in Equation (2) is similar to comparing the marginal ef- 
fects of the discretionary accrual measures for the actual benchmark and pseudo targets. 
In particular, comparisons of the marginal effects for DisAccruals for the actual bench- 
marks and pseudo targets generate the same sign and statistical significance as Ac- 
tual*DisAccruals. Likewise, the ratio of one coefficient (e.g., DisAccruals) to another (e.g., 
CashFlows) is similar in magnitude to the ratio of their marginal effects, which allows one 
to compare the relative marginal effects of each variable. 

Consistent with the relationship between discretionary accrual proxies and firm per- 
formance increasing around the profit benchmark, Table 1, Panel D reports that the coef- 
ficients for Total Accruals, Deferred Tax Expense, Modified Jones Dis. Accruals, and Fwd- 
Look Dis. Accruals around the actual profit benchmark are significantly larger than the 
average association between the respective discretionary accrual measures and beating 
the pseudo profit targets. In particular, the coefficients for Actual*Total Accruals, Ac- 
tual*Deferred Tax Expense, Actual*Modified Jones Dis. Accruals, and Actual*Fwd-Look 
Dis. Accruals are positive and significant (р = .00, .00, .06, and .00, respectively, one- 
tailed test). 

In addition, Table 1, Panel C reports that the coefficients for -Total Accruals, Deferred 
Tax Expense, and Fwd-Look Dis. Accruals estimated around the actual profit benchmark 
are significantly larger than the majority of the estimated coefficients from the pseudo profit 
target comparisons. In particular, the estimated coefficients around the profit benchmark are 
significantly larger than 19 of the 19 coefficients for Total Accruals for the pseudo targets, 
16 of the 19 coefficients for Deferred Tax Expense for the pseudo targets, and 17 of the 
19 coefficients for Fwd-Look Dis. Accruals for the pseudo targets (p < .10, one-tailed test). 
These frequencies are significantly higher than expected (p = .00, one-tailed test) assuming 
a binomial distribution and 10 percent likelihood that the actual benchmark coefficient is 
significantly larger than a pseudo target coefficient by chance. We find that the estimated 
coefficient for Modified Jones Dis. Accruals for the profit benchmark is significantly larger 
(p < .10, one-tailed test) than four of the 19 coefficients for Modified Jones Dis. Accruals 
for the pseudo targets. 39 

In sum, results suggest that the positive associations between discretionary accrual 
proxies and firm profits intensify around the actual profit benchmark. These findings are 
consistent with the earnings management hypothesis—i.e., the association between discre- 
tionary accrual proxies and firm performance shifts in those settings where firms have 
greater incentives to manage earnings. However, the associations between CashFlows and 
firm profits generally also increase around the actual profit benchmark. Specifically, Table 
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1, Panel D reports that the association between CashFlows and beating the actual profit 
benchmark is significantly larger (p = .05, one-tailed test) than the average association 
between CashFlows and beating the pseudo profit targets in three of the four specifications 
of Equation (2). Likewise, Table 1, Panel C indicates that the coefficient for CashFlows 
from the regression estimated around the actual profit benchmark is significantly larger than 
most of the coefficients for CashFlows for the pseudo earnings targets. 

There are at least three possible explanations for the increased association between 
CashFlows and earnings around the actual profit benchmark. First, it is possible that firms 
manage both cash flows and discretionary accruals around the profit benchmark (Burgstahler 
and Dichev 1997). Indeed, evidence reported by Roychowdhury (2003) suggests that firms 
manage cash flows from operations to avoid reporting losses. Second, it is possible that 
cash flows from operations better represent firm performance around the profit benchmark. 
For example, the association between cash flows from operations and net income may 
increase around the profit benchmark if firms are less likely to report extraordinary items 
around the profit benchmark. We examined the distribution cf extraordinary items for our 
sample and note that extraordinary items are more pronounced at the tails of the earnings 
distributions. In sensitivity analysis, we re-estimated our probit regressions after limiting 
the sample to firms that do not report extraordinary items. Results are similar to those 
reported in Table 1, Panels C and D for our discretionary accrual measures. In addition, 
the coefficients for CashFlows remain more pronounced around the profit benchmark. 
Third, as suggested in Dechow et al. (2003), it is possible that managers and employees 
may simply work harder around earnings benchmarks. If this conjecture is descriptive, then 
it could, in part or in aggregate, explain the unique association between discretionary ac- 
cruals and beating the profit benchmark. Accordingly, we are unable to conclude with 
confidence that earnings management explains this association. 


Earnings Increase Benchmark Analyses 

Table 2, Panels A and B, report univariate analyses for the earnings increase benchmark. 
In each row of Panel A, we test whether the mean values of the discretionary accrual 
measures for that particular earnings change bin is significantly higher (i.e., more positive 
or less negative) than the mean values of the discretionary accrual measures in the preceding 
row. Panel A indicates that both Total Accruals and Deferred Tax Expense are significantly 
larger (i.e., more positive or less negative at р = .10, one-tailed test) in the .00 = AE, 
< .01 bin (і.е., the actual just-beat bin) relative to the —.01 = AE, < .00 bin (i.e., the 
actual just-miss bin).'' However, these same relations hold for seven of the 19 other com- 
parisons for Total Accruals and three of the 19 other comparisons for Deferred Tax Expense 
(p = .10, one-tailed test). 

The last row of Table 2, Panel A indicates that the mean values of the discretionary 
accruals are highly positively correlated with the earnings change bin number. The Pearson 
correlation coefficients for Total Accruals, Deferred Tax Expense, Modified Jones Dis. Ac- 
cruals, and Fwd-Look Dis. Accruals are .91, .83, .85, and .94, respectively, and each is 
statistically significant (p = .00). Panel B presents the Pearson correlation coefficients for 
each variable. Total Accruals, Deferred Tax Expense, Modified Jones Dis. Accruals, and 
Fwd-Look Dis. Accruals are each positively correlated with earnings changes (AE,) (p 
= 10, two-tailed test). In sum, univariate analyses suggest that the associations between 


п Phillips et al. (2003) report similar results for univariate comparisons of Total Accruals, Deferred Tax Expense, 
Modified Jones Dis. Accruals, and Fwd-Look Dis. Accruals for just-beat and just-miss firms around the actual 
earnings increase benchmark. 
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discretionary accrual proxies and reporting more positive earnings changes may not be 
unique to comparisons centered on the earnings increase benchmark. 


Comparisons across Pseudo Targets — - | 

Table 2, Panel С reports the results of the pseudo target and actual benchmark апају- 
505.12 Each row represents the results for estimating Equation (1) for a different just-beat 
and just-miss sample. We highlight positive and significant coefficients (p = .10, one-tailed 
test) with bold text and identify the actual just-beat and just-miss groups with boxed text. 
In the last two rows we tally the number of significant positive coefficients (k) for the 
pseudo target regressions for each discretionary accrual measure (р = .10, one-tailed test) 
and present the probability of “drawing” k or more significant positive coefficients assum- 
ing a binomial distribution and 10 percent likelihood of drawing a significant positive 
coefficient by chance. Panel C reports significant positive associations between (1) beating 
the actual earnings increase benchmark and (2) Total Accruals, Deferred Tax Expense, and 
Fwd-Look Dis. Accruals (p = .01, one-tailed test). The coefficient for Modified Jones Dis. 
Accruals is positive but not statistically significant (p = .25, one-tailed test). 

Results for the pseudo earnings change target analysis are similar to those for the 
pseudo profit target analysis, albeit less pronounced. Of the 19 comparisons across adjacent 
earnings change bins, segregated based on pseudo earnings change targets, we find a sig- 
nificant positive association (p = .10, one-tailed test) between beating the pseudo target 
and (1) Total Accruals in nine comparisons (47 percent), (2) Deferred Tax Expense in five 
comparisons (26 percent), (3) Modified Jones Dis. Accruals in four comparisons (21 per- 
cent), and (4) Fwd-Look Dis. Accruals in eight comparisons (42 percent). These frequencies 
of significant positive coefficients are significantly higher than expected by chance at p 
= .00, p = .04, and p = .00 (one-tailed test) for Total Accruals, Deferred Tax Expense, 
and Fwd-Look Dis. Accruals, respectively, and at p = .12 (one-tailed test) for Modified 
Jones Dis. Accruals. Thus, similar to the profit benchmark analysis, it is difficult to distin- 
guish whether the associations between Total Accruals, Deferred Tax Expense, and Fwd- 
Look Dis. Accruals and beating the earnings increase benchmark are attributable to earnings 
management or the associations between these discretionary accruals proxies and earnings 
changes. Finally, Table 2, Panel C indicates that ACashFlows is positively related to beating 
the earnings increase benchmark and nine of 19 pseudo earnings change targets (p = .10, 
one-tailed test). 


Tests for Shifts in the Association between Discretionary Accrual Measures and 
Firm Performance around the Earnings Increase Benchmark 

We estimate Equation (2) to test whether the associations between discretionary accrual 
proxies and firm performance increase around the actual earnings increase benchmark (i.e., 
where earnings management incentives may be stronger). In Table 2, Panel C, we identify 
with superscripts “с” and “d” those regression coefficients from the pseudo target analyses 
that are significantly smaller (p = .05 and p = .10, one-tailed test, respectively) than the 
estimated coefficient from the actual earnings increase benchmark (i.e., significantly smaller 
than the coefficients highlighted in boxed text). In Table 2, Panel D, we report the results 
when we compare the associations between discretionary accrual proxies and beating the 
actual earnings increase benchmark to the average association between the discretionary 
accrual proxies and beating the pseudo targets. 


12 Ош probit regression results comparing just-beat and just-miss firms around the actual earnings increase bench- 
mark are similar to the probit results reported in Phillips et al. (2003). 
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Consistent with the relationship between discretionary accrual proxies and firm per- 
formance becoming more pronounced around the earnings increase benchmark, Table 2, 
Panel D reports that the regression coefficients for Total Accruals and Deferred Tax Expense 
estimated around the actual earnings increase benchmark are significantly larger (p = .00 
and .03, respectively, one-tailed test) than the average association between the respective 
discretionary accrual measures and beating the pseudo earnings change targets. The differ- 
ence between the coefficient for Fwd-Look Dis. Accruals estimated around the earnings 
increase benchmark and the average association between Fwd-Look Dis. Accruals and beat- 
ing the pseudo earnings change targets is positive as expected but not statistically significant 
(p = .13, one-tailed test). 

Table 2, Panel C reports that the coefficients for Total Accruals, Deferred Tax Expense, 
and Fwd-Look Dis. Accruals from the regressions estimated around the actual earnings 
increase benchmark are significantly larger than many of the estimated coefficients from 
the pseudo profit target comparisons. In particular, the estimated coefficients around the 
earnings increase benchmark are significantly larger than 18 of the 19 coefficients for Total 
Accruals for the pseudo targets, 12 of the 19 coefficients for Deferred Tax Expense for the 
pseudo targets, and eight of the 19 coefficients for Fwd-Look Dis. Accruals for the pseudo 
targets (р < .10, one-tailed test). These frequencies are significantly higher than expected 
(p = .00, one-tailed test) assuming a binomial distribution and 10 percent likelihood that 
the actual benchmark coefficient is significantly larger than a pseudo target coefficient Бу , 
chance. 

The associations between ACashFlows and reporting more positive earnings changes 
generally also increase around the actual earnings increase benchmark. Specifically, Table 
2, Panel D reports that the association between ACashFlows and beating the actual earnings 
increase benchmark is significantly larger (p = =.00, one-tailed test) than the average 
association between ACashFlows and beating the pseudo earnings change targets in all four 
specifications of Equation (2). Likewise, Table 2, Panel C indicates that the estimated 
coefficient for ACashFlows around the actual earnings increase benchmark is significantly 
larger than each of the coefficients for ACashFlows for the pseudo earnings change targets. 
As discussed above, these results are consistent with firms managing both cash flows and 
discretionary accruals around the earnings increase benchmark, or managers and employees 
working harder around the earnings increase benchmark. Accordingly, we are unable to 
conclude with confidence that earnings management explains the associations between dis- 
cretionary accruals and beating the earnings increase benchmark. 


Analysts’ Forecast Benchmark Analyses 

Table 3, Panels A and B report the univariate analyses for the analysts’ forecast bench- 
mark. Panel A indicates that Total Accruals, Modified Jones Dis. Accruals, and Fwd-Look 
Dis. Accruals are significantly larger (more positive or less negative) in the actual just-beat 
bin relative to the actual just-miss bin.? These same relations hold for only one of the 19 
other comparisons for Modified Jones Dis. Accruals and none of the 19 other comparisons 
for Total Accruals and Fwd-Look Dis. Accruals (p = .10, one-tailed test). Unlike the results 
for the profit and earnings increase benchmark analyses, there appears to be little positive 


13 Using the last mean consensus analyst forecast from the Adjusted I/B/E/S Summary file as the analysts’ forecast 
benchmark, prior research (Dechow et al. 2000; Phillips et al. 2003) reports no significant differences in uni- 
variate comparisons of Total Accruals, Deferred Tax Expense, Modified Jones Dis. Accruals, and Fwd-Look Dis. 
Accruals for just-beat and just-miss firms around the actual analysts’ forecast benchmark. We find similar results 
to prior research when we use the last mean consensus analyst forecast from the Adjusted I/B/E/S Summary 
file as the analysts’ forecast benchmark. 
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correlation between the mean values of the discretionary accrual measures and the unex- 
pected earnings bins. In particular, the Pearson correlation coefficients for Total Accruals, 
Deferred Tax Expense, Modified Jones Dis. Accruals, and Fwd-Look Dis. Accruals are .09, 
.26, —.09, and —.36, respectively. None of the correlations are statistically significant. 
Likewise, Panel B reports that none of four discretionary accrual proxies have a significant 
positive correlation with UE. 


Comparisons across Pseudo Targets 

Table 3, Panel C reports a significant positive association between (1) beating the actual 
analysts’ forecast benchmark and (2) Total Accruals, Modified Jones Dis. Accruals, and 
Fwd-Look Dis. Accruals (p = .06, .04, and .04, respectively, one-tailed test).'^ We find a 
positive but insignificant association for Deferred Tax Expense (p = .29, one-tailed test). 
In addition, we find few positive associations between the discretionary accrual measures 
and beating the pseudo analysts-based unexpected earnings targets. Specifically, the fre- 
quency of significant positive coefficients for each discretionary accrual measures is not 
significantly different from what would be expected by chance. Finally, Panel C indicates 
that ACashFlows is not significantly associated with beating the analysts’ forecast bench- 
mark and is only significantly positively related to beating one of the 19 pseudo unexpected 
earnings targets (р = .10, one-tailed test).!° 


Tests for Shifts in the Association between Discretionary Accrual Measures and 
Firm Performance around the Analysts’ Forecast Benchmark 

Because we find little evidence of a positive association between discretionary accrual 
measures and beating pseudo targets derived from analysts-based unexpected earnings, the 
comparisons of the coefficients for Total Accruals, Modified Jones Dis. Accruals, and Fwd- 
Look Dis. Accruals across the actual analysts’ forecast benchmark and pseudo targets are 
less informative. Nonetheless, Table 3, Panel D reports that the coefficients for Total Ac- 
cruals, Modified Jones Dis. Accruals, and Fwd-Look Dis. Accruals estimated around the 
actual analysts’ forecast benchmark are significantly larger (р = .07, .03, and .04, respec- 
tively, one-tailed test) than the average association between the respective discretionary 
accrual measures and beating the pseudo analysts’ forecast targets. Likewise, Panel C re- 
ports that the coefficients for Total Accruals, Modified Jones Dis. Accruals, and Fwd-Look 
Dis. Accruals estimated around the actual analysts’ forecast benchmark are significantly 
larger than many of the estimated coefficients from the pseudo profit target comparisons. 
Finally, unlike the profit and earnings increase benchmark analyses, we find no associations 
between ACashFlows and beating the analysts’ forecast benchmark. Accordingly, we are 
able to draw stronger conclusions regarding earnings management around the analysts’ 
forecast benchmark. 


14 Phillips et al. (2003) uses the last mean consensus analyst forecast from the Adjusted I/B/E/S Summary file 
as the analysts’ forecast benchmark. When we use the last mean consensus analyst forecast from the Adjusted 
1/В/В/5 Summary file, probit results for the actual analysts’ forecast benchmark are similar to those reported 
in Phillips et al. (2003). 

15 In sensitivity analysis, we compared the price-to-earnings (PE) multiples and book-to-market (BM) ratios of 
firms that just beat the analysts’ forecast benchmark to other sample firms to determine if the just-beat firms 
are more likely to represent growth firms. We find no significant difference in PE multiples or book-to-market 
ratios for firms that just beat the analysts’ forecast benchmark (i.e., UE = .00) compared to those firms that just 
miss the benchmark (UE = —.01). 
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Alternative Analysts’ Forecast Specifications 


We use the single-most recent forecast from the Unadjusted I/B/E/S Detail History 
file as the forecast proxy in our main analyses. O’Brien (1988) and Brown (1991) suggest 
that the single-most recent analyst forecast is more accurate in predicting actual earnings 
than the consensus mean forecast. Likewise, Brown and Kim (1991) find that the single- 
most recent analyst forecast more accurately reflects the market’s earnings expectation than 
the consensus mean forecast (i.e., the average of all outstanding forecasts). I/B/E/S re- 
cently released the Unadjusted Detail History file that includes analysts’ forecasts that have’ 
not been adjusted for subsequent stock splits. Previously, I/B/E/S only provided earnings 
forecasts adjusted for subsequent stock splits, which prior research indicates may result in 
classification errors due to rounding (Baber and Kang 2002; Payne and Thomas 2003). 

In sensitivity analysis, we re-estimate our probit regressions to determine whether re- 
sults are robust to other forecast measures or are sensitive to using data from the Unadjusted 
or Adjusted I/B/E/S files. We first re-estimate our analyses using data from the Adjusted 
I/B/E/S Detail History file (1) including all sample observations, (2) eliminating firms with 
a stock split factor exceeding 3, and (3) eliminating firms with a stock split factor exceeding 
1. Results are consistent with those presented in Table 3, Panels C and D for each alternative 
sample. 

We also re-estimate our analyses using the last mean consensus analyst forecast from 
the Unadjusted and Adjusted I/B/E/S Summary files. Results weaken when we use the 
last mean consensus analyst forecast from the Unadjusted file and disappear when we use 
the last mean consensus analyst forecast from the Adjusted file. Specifically, using data 
from the Unadjusted I/B/E/S Summary file, we find that only two of the abnormal accrual 
measures (i.e., Modified Jones Dis. Accruals and Fwd-Look Dis. Accruals) are statistically 
significant (p — .09, one-tailed test). When we use the last mean consensus forecast reported 
by the Adjusted I/B/E/S Summary file (1) including all sample observations, (2) elimi- 
nating firms with a stock split factor exceeding 3, and (3) eliminating firms with a stock 
split factor exceeding 1, we find that none of the discretionary accrual measures are posi- 
tively associated with beating the analysts' forecast benchmark. These results suggest that 
the analysts’ forecast results weaken when we use a more *'stale" forecast proxy. Assuming 
that firms intend to meet or beat market expectations, one would expect that results i improve 
when we use a forecast proxy that better represents these expectations. 


V. SUPPLEMENTAL ANALYSES 
ROA Performance-Matching 

Kothari et al. (2005) provide evidence that “performance matching is critical to de- 
signing well-specified tests of earnings management.” In particular, they find that ROA 
performance-matched accrual measures mitigate Type I research errors when the partition- 
ing variable is correlated with performance. One advantage of our analysis relative to 
performance-matching is that it does not require specifying the exact nature of the corre- 
lation between discretionary accruals and performance—i.e., it does not require specifying 
a metric to match upon (e.g., ROA). Accordingly, relative to performance-matching this 
analysis may result in more powerful tests of earnings management around earnings bench- 
marks as it does not introduce “noise”’ in the tests via the matching process. 

To compare our results to those from performance-matching, we re-estimate our probit 
regressions using ROA performance-matched discretionary accrual measures. For the earn- 
ings change and analysts-based unexpected earnings analyses, we match firms based on 
their current ROA and two-digit industry as suggested by Kothari et al. (2005). Because 
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comparisons based on firm profits directly relate to current ROA, we match firms on lagged 
ROA and two-digit SIC industry for the profit target analyses.'© Kothari et al. (2005) find 
that matching on current or lagged ROA results in better-specified tests than discretionary 
accrual measures that are not performance-matched, although matching on current ROA 
generally performs better than lagged ROA matching. We then re-estimate our probit models 
using discretionary accrual measures adjusted by the matched firm’s discretionary accruals. 

Table 4 presents the results including Fwd-Look Dis. Accruals as the discretionary 
accrual proxy in the profit, earnings increase, and analysts’ forecast benchmark analyses. 
Results are generally much weaker than those reported in Panel C of Tables 1, 2, and 3. 
In particular, we find that Fwd-Look Dis. Accruals is no longer significantly associated with 
beating the actual profit or analysts’ forecast benchmark. Conclusions are similar when we 
use Total Accruals, Deferred Tax Expense, or Modified Jones Dis. Accruals to proxy for 
discretionary accruals—i.e., we find little evidence of significant positive associations be- 
tween performance-matched discretionary accrual proxies and beating the profit or analysts’ 
forecast benchmark. 

There are two plausible explanations for the considerably weaker results for perform- 
ance-matched discretionary accruals: (1) performance-matching reduces the power of sta- 
tistical tests of earnings management around earnings benchmarks, and (2) the previous 
associations we find between discretionary accrual measures and beating the profit and 
analysts’ forecast benchmarks are attributable to an underlying association between discre- 
tionary accrual proxies and firm performance that is unrelated to earnings management. 
Our previous analyses indicate that discretionary accrual proxies are not systematically 
associated with reporting more positive analysts-based unexpected earnings. On balance, 
therefore, we interpret explanation (1) to be more descriptive for the performance-matching 
results for the analysts’ forecast benchmark. Given our prior evidence of a systematic pos- 
itive association between discretionary accrual proxies and reporting higher earnings, we 
are unable to distinguish explanations (1) and (2) for the performance-matching results for 
the profit benchmark. 


Alternative Explanations for the Profit and Earnings Change Kinks 

Recent studies provide alternative explanations (i.e., besides earnings management) for 
the kinks in the earnings and earnings changes distributions around the profit and earnings 
increase benchmarks. Dechow et al. (2003) and Durtschi and Easton (2005) demonstrate 
that the kinks in the earnings and earnings change distributions disappear when earnings 
and earnings changes are deflated by shares outstanding instead of market value. In contrast, 
Beaver et al. (2003) contend that much of the kink in the earnings and earnings changes 
distributions are attributable to factors not related to earnings management, such as the 
asymmetric treatment of income taxes and special items for loss firms versus profitable 
firms. 

We conduct additional analyses to control for the effects that the asymmetric treatment 
of income taxes and special items for loss and profits may have on our inferences. Specif- 
ically, we re-estimate the profit, earnings increase, and analysts’ forecast benchmark anal- 
yses after including two additional control variables, special items deflated by lagged assets 
and current tax expense deflated by lagged assets. Results are similar to those presented in 
the tables. In sum, we find no evidence that differences in the asymmetric treatment of 


16 Consistent with Kothari et al. (2005), we match sample firms to the firm in their two-digit industry with the 
closest current or lagged ROA. In addition, we require that the matched firm resides in a bin other than the just- 
beat or just-miss bin for the specific comparison. 
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special items and income taxes explain the previous associations between discretionary 
accruals and beating the profit, earnings increase, and analysts’ forecast benchmarks. 


AAER Analyses 

We collected a sample of firms subject to SEC enforcement action for overstating 
earnings to determine if these firms are more likely to reside in the just-beat bins around 
the actual earnings benchmarks. Following the method in Dechow et al. (1995, 1996), we 
conducted a keyword search for “Section 13(a)" in the Accounting and Auditing Enforce- 
ment Releases (AAERs) Database through Lexis-Nexis. We searched for AAERs issued 
between 1994 and November 2004 with an identified manipulation between fiscal year 
1994 and 2002 to match our sample period. We eliminated releases involving violation of 
auditing standards, multiple releases involving the same firms, and firms not listed on 
Compustat. The final sample consisted of 392 firm-years subject to enforcement actions by 
the SEC for overstating earnings between 1994 and 2002. 

We then plotted the percentage of AAER firm-years (i.e., the violation years) in each 
"bin" of the distribution of earnings, earnings changes, and analysts-based unexpected 
earnings against the relative percentages for this study's sample of firms (i.e., from Panel 
A of Tables 1, 2 and 3). The percentage of AAER firms in the UE = 0 bin for the analysts’ 
forecast benchmark (39.7 percent) (not tabulated) is significantly higher (p — .00, one-tailed 
binomial test) than the underlying population of firms that reside in this bin (15.0 percent). 
Less pronounced differences exist for the profit benchmark (i.e., 6.9 percent of AAER firms 
reside in the .00 = E, < .01. bin versus 5.2 percent for all firms, p = .14, one-tailed 
binomial test). For the earnings increase benchmark, the percentages of AAER firms in the 
.00 = AE, < .01 bin (8.8 percent) is smaller than the underlying population of firms that 
reside in this bin (13.0 percent). Similar to previous analyses, these results allow stronger 
conclusions regarding earnings management around the analysts' forecast benchmark. This 
evidence is also consistent with recent studies that conclude that the analysts' forecast 
benchmark is the most important benchmark during the sample period (Dechow et al. 2003; 
Brown and Caylor 2005). 


VI. CONCLUSION 

This study investigates the extent to which the positive associations between alternative 
discretionary accrual measures and beating benchmarks are attributable to earnings man- 
agement. We find that the positive associations between discretionary accrual measures and 
beating the profit benchmark bold for comparisons of groups segregated based on pseudo 
profit targets. We find similar results for pseudo targets across the earnings change distri- 
bution. In contrast, we find that the positive associations between discretionary accruals 
and beating the analysts' forecast benchmark generally do not extend to comparisons across 
pseudo targets based upon analysts-based unexpected earnings. These results suggest that 
demonstrating positive associations between a specific discretionary accrual measure and 
beating the profit and earnings increase benchmark is not sufficient to conclude that the 
discretionary accrual actually detects earnings management. Likewise, this evidence indi- 
cates that the underlying association between firm performance and discretionary accrual 
measures is less problematic for tests of earnings management using analysts-based unex- 
pected earnings. | 

We provide an additional analysis that specifically accounts for the systematic associ- 
ation between discretionary accrual measures and earnings. Implicit in studies analyzing 
accruals around earnings benchmarks is the expectation that the fundamental relationship 
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between firm performance and discretionary accruals shifts around the earnings bench- 
marks. Accordingly, we test whether the associations between discretionary accrual proxies 
and firm performance are more pronounced around the profit benchmark. We conduct sim- 
ilar analyses for the actual earnings increase benchmark. Consistent with earnings man- 
agement, results suggest that the association between discretionary accruals and earnings 
intensifies around the profit benchmark. We find similar effects around the actual earnings 
increase benchmark. However, analogous patterns exist for cash flows around the profit and 
earnings increase benchmarks. Thus, we are unable to conclude with confidence that earn- 
ings management explains the associations between discretionary accruals and beating the 
profit and earnings increase benchmarks. 

One advantage of this study’s analysis relative to performance-matched discretionary 
accruals (Kothari et al. 2005) is that it does not require specifying a metric to match upon 
(e.g., ROA)—i.e., it does not require specifying the exact nature of the correlation between 
discretionary accruals and performance. Relative to performance-matching this analysis may 
result in more powerful tests of earnings management around earnings benchmarks as it 
does not introduce "noise" in the tests via the matching process. Accordingly, this method 
may reduce the likelihood of falsely accepting the null hypothesis of no earnings manage- 
ment due to low-power tests. 

This study's analyses should aid future researchers in investigating earnings manage- 
ment around earnings benchmarks using specific financial statement accounts (e.g., loan 
loss provision, valuation allowance, pension expense, etc.) or alternative abnormal accrual 
measures. In addition, this study’s analyses should prove useful in tests for cross-temporal 
shifts in earnings management activity (e.g., pre- and post-Sarbanes-Oxley) or of other 
earnings management hypotheses (e.g., smoothing or big bath) that may be hindered by 
the systematic association between discretionary accrual proxies and performance. Finally, 
the pseudo target analyses may be particularly informative for studies assessing the prop- 
erties of néwly developed abnormal accrual measures. 

This study’s analyses should be interpreted in light of the following limitations. First, 
we assume that firms have greater incentives to manage earnings around the actual earnings 
benchmarks. If this assumption is not descriptive, then tests for increased associations be- 
tween discretionary accruals and beating earnings benchmarks are not informative. Second, 
if firms manage earnings throughout the distributions of earnings, earnings changes, and 
unexpected earnings, then tests for increased associations between discretionary accruals 
and beating earnings benchmarks underestimate the association between discretionary ac- 
cruals and beating the actual benchmarks that is attributable to earnings management. Third, 
our analyses assume that the underlying association between discretionary accruals and 
performance (i.e., irrespective of earnings management) does not shift around the actual 
earnings benchmarks. If this assumption is not descriptive, then tests could be biased in 
favor of or against finding more pronounced associations around the actual earnings 
benchmarks. 
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ABSTRACT: Rotation of audit partners is one of the main policy Initiatives that has 
been implemented In many Jurisdictions around the world to deal with concerns about 
audit quality. The basis of any requirement limiting the tenure of audit partners Is that 
there Is a reduction in audit quality associated with long periods of tenure. Using data 
from Australla, where the audit partner can be Identified and for a period where partner 
rotatlon was not mandatory, we examine the association between audit quality and 
long audit partner tenure. The three measures of audit quality examined are the audl- 
tor's propensity to issue a going-concern audit opinion for distressed companies, the 
direction and amount of abnormal working capital accruals, and just beating (missing) 
earnings. benchmarks. For long tenure observations we find a lower propensity to issue 
a going-concem opinlon and some evidence of Just beating (missing) earnings bench- 
marks, consistent with deterloration in audit quality associated with long audit partner 
tenure. There Is no evidence of an assoclation of long audit tenure with abnormal 
working capital accruals. 


Keywords: audit partner tenure; audit quality; qualifications; earnings management. 
Data Availability: A// data has been extracted from publicly available sources. 


L INTRODUCTION 
policy of mandatory rotation of audit partners for a particular audit client is sug- 
gested as a means of improving audit quality and therefore increasing the quality 
of general-purpose financial statements. The benefits normally espoused are that 
the independence of the engagement partner is maintained or that a new perspective on the 
audit may result in the identification of issues that have been overlooked on previous audits. 
These benefits therefore rely on the belief that extended audit partner tenure is associated 
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with decreases in audit quality. This research study provides evidence on the extent to 
which long audit partner tenure is associated with decreases in audit quality. 

In countries (such as the United States) where a policy of partner rotation is already 
in place, it is impossible to undertake an empirical evaluation of whether this policy is 
beneficial, as it is impossible to identify the consequences of not having (and therefore the 
benefits associated with implementing) this policy. The basis for the policy, that extended 
audit partner tenure is associated with decreased audit quality, is best informed by under- 
taking the analysis in an environment and at a time where the policy is not mandatory and 
the audit partner can be identified. Australia provides such an environment. There has been 
a statutory requirement for the auditor of listed Australian companies to sign the audit 
report in their own name as well as in the name of the audit firm since the 1970s (section 
324 (10) of. Australian Corporations Act). In Australia there was neither a legal nor pro- 
fessional requirement to rotate the audit partner until 2001.1 

The basis for a policy of audit partner rotation is that there is a diminution in audit 
quality over the tenure of an audit partner. Such a policy assumes that a sufficiently high 
level of audit quality is achieved in the years up to recommended rotation. It is inferred 
that after a long period of tenure, continuing relationships may impact a partner’s indepen- 
dence and time may erode а partner's capacity for.critical appraisal. If these potential factors 
result in a reduction in audit quality or in audit failure, then they will impose a cost on 
society and the profession.” 

This study investigates if there is a diminution in audit quality associated with long 
partner tenure using three common measures of audit quality: the auditor’s propensity to 
issue a going-concern audit opinion; an examination of the signed and absolute amount of 
abnormal working capital accruals; and an analysis of the extent to which key earnings 
targets are just beaten (missed). 

We undertake a cross-sectional analysis where we trace the length of partner tenure for 
all listed companies in Australia for a specific year to identify any association with audit 
quality. The year we choose is 1995, before a rotation policy became mandatory (2001) 
and before it was voluntarily adopted by any accounting firms (1997, refer to footnote 1). 
This time period allows for the identification of long tenure situations and therefore facil- 
itates an examination of the theoretical underpinnings of an audit partner rotation policy; 
that there is a diminution in audit quality associated with long audit partner tenure. 

For long partner tenure observations we find that there is a lower propensity to issue 
a going-concern opinion. There is no evidence of an association of long partner tenure with 
either the signed or absolute amount of abnormal working capital accruals. For the third 
audit-quality measure we find that there is some evidence of just beating (missing) earnings 
benchmarks for long partner tenure observations. The findings with regard the auditor’s 
propensity to issue a going-concern audit opinion and the extent to which key earnings 
targets are just beaten (missed) are consistent with a deterioration in audit quality associated 
with long partner tenure. Sensitivity analyses demonstrate that identified deterioration in 
audit quality is specifically associated with non-Big 6 audit firms. 


! Discussions with each of the Big 6 firms in Australia reveal that none of these firms had a partner rotation 
policy before 1997. Between 1997 and 2001 Arthur Andersen had a policy of mandatory partner rotation after 
seven years. The other Big 6 firms were aware of the issue, and undertook partner rotation on a best practice 
but not mandatory basis. In 2001, the accounting bodies in Australia introduced a professional requirement to 
rotate the audit partner for all listed companies at least every seven years. 

? The auditing profession will collectively suffer loss with each audit failure through a reduction in the profession's 
credibility (Grant et al. 1996). 
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The paper is structured as follows. Section II outlines recent key policy initiatives 
regarding audit partner rotation in some of the major jurisdictions around the world. Section 
Ш outlines the literature review and development of theoretical arguments. This is followed 
by separate sections outlining the research methodology and the results for each of the 
three audit-quality measures. Section X concludes and outlines the implications of the 

_ findings. 


: П. RECENT POLICY DEVELOPMENTS REGARDING AUDIT PARTNER 
ROTATION IN MAJOR JURISDICTIONS 

There is considerable variation in international policy regarding periodic audit partner 
rotation. The issue continues to attract policy attention in many countries. The policy first 
emerged in:the U.S. in the 1970s, at which time the AICPA Practice Section mandated 
periodic audit partner rotation after seven years of tenure (this became a requirement for 
U.S. Securities and Exchange Commission registered clients). Recent corporate events led 
to a revision of this policy with the signing of the Sarbanes-Oxley Act of 2002 and there 

-is now a requirement that the lead audit partner and audit review partner be rotated every 
five years on public company engagements. 

In the United Kingdom, the Cadbury Committee (Committee on the Financial Aspects 
of Corporate Governance 1992) argued in favor of partner rotation and regulations were 
subsequently adopted that followed the U.S. approach of requiring rotation of the audit 
engagement partner after seven years for listed companies. In January 2003, the Department 
of Trade and Industry and the U.K. Treasury issued a final report on review of audit and 
accountancy regulations. The report recommended that the maximum period before rotation 
be reduced from seven іо five years for the lead audit partner. The report also welcomed 
the recommendation from the EU that the partner rotation requirement should extend to 
other key audit partners (such as those involved in second partner review). It recommended 
a maximum séven-year period before rotation of other key audit partners. 

‘More than 80 countries use the international auditing standards and Code of Ethics 
prepared by the International Federation of Accountants (IFAC) as the basis for their stan- 
dards. In 2001 IFAC recognized in their Code of Ethics that prolonged use of the same 
lead engagement partner on ап audit may create what they termed a "familiarity threat.” 
The code proposed that the lead engagement partner be rotated after a pre-defined period, 
normally no more than seven years, and that the rotating partner not resume the lead 
engagement partner role until an additional two years have elapsed. 

In Australia, the earliest references to partner rotation arose in a report issued by the 
professional accounting bodies (Australian Society of CPAs and The Institute of Chartered 
Accountants in Australia 1994). On the basis of concerns about audit quality they advo- 
cated, among other reforms, the rotation of audit engagement partners every seven years. 
This suggestion was not adopted by the profession. In 1996 an Australian Commonwealth 

-Government working party released a review of the requirements for the regulation of 
auditors, which recommended mandatory rotation of audit partners every seven years. Again 
this suggestion was not adopted, although the independence standard at the time, AUP32, 
was revised to require “consideration” of a rotation policy. In 2001 the Australian profes- 
sion’s independence standard was revised in line with changes to the IFAC Code of Ethics. 
It stated in paragraph 2.51 that the lead partner be rotated after a predefined period of no 
longer than seven years. In 2004, legislation was introduced that requires the audit partner 
of listed companies to be rotated after no more than five years. There was no empirical 
evidence provided to support the benefits of either the introduction of a policy of partner 
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rotation or a change in the conditions of this policy. As outlined earlier, the basis of any 
policy of audit partner rotation is that there is a diminution in the quality of the audit 
associated with long audit partner tenure. This study examines the existence of such an 
association. 


Ш. LITERATURE REVIEW AND DEVELOPMENT OF 
TESTABLE HYPOTHESIS 

While there are a number of recent studies on audit firm tenure (Ghosh and Moon 
2005; Myers et al. 2003; Davis et al. 2003; U.S. Government Accounting Office [GAO] 
2003; Geiger and Raghunandan 2002; Johnson et al. 2002), there is very little empirical 
examination of the relationship between audit partner tenure and audit quality. The only 
published paper to date to examine this issue is Chi and Huang (2005), who, in an exam- 
ination of the Taiwanese audit market, find evidence consistent with lower earnings quality 
in the early years of audit tenure (either firm or partner) and the later years of tenure (firm), 
where earnings quality is measured as the level of abnormal accruals. However, their results 
are contingent on the method used to calculate the level of abnormal accruals. 

The literature on audit firm tenure suggests deterioration in audit quality in the early 
years of tenure due to a loss of client-specific knowledge and expertise. DeAngelo (1981) 
identifies a "learning curve" that gives incumbent auditors a comparative quality advantage. 
Continuity on an audit is said to reduce audit risk due to a familiarity with the client's 
system and an understanding of risks associated with the client's business/industry envi- 
ronment (Financial Reporting Commission [Ryan Commission Report] 1992; AICPA 
[Cohen Commission Report] 1978). 

While there is some empirical evidence of a heightened risk of reduced audit quality 
during the first years of an audit firm's tenure (AICPA 1992; Johnson et al. 2002; St. Pierre 
and Anderson 1984) and higher audit quality with longer audit firm tenure (Myers et al. 
2003), the extent to which these relationships translate to audit partner tenure is not clear. 
The risk of audit failure in the years immediately after partner rotation is likely to be less 
severe than the risk in the first years of an audit firm's tenure due to other quality controls 
that an audit firm can establish to mitigate risks associated with a new partner's lack of 
familiarity (e.g., second partner review and standard new engagement familiarization pro- 
cedures for incoming partner) and the maintenance of knowledge and expertise within the 
firm (e.g., continuity of field staff, the carrying forward of working papers, partner famil- 
iarity with existing audit methodology and client databases). 


Association between Long Audit Partner Tenure and Audit Quality 

The two (related) primary arguments supporting a negative association between long 
audit partner tenure and audit quality are (1) erosion of independence that may arise with 
the development of personal relationships between an auditor and their client, and (2) 
deterioration in the audit partner's capacity to effect critical appraisal (described in the 
IFAC Code of Bthics as a familiarity threat). 

The development of personal relationships between client management and the audit 
partner may threaten the independence of the auditor. Mautz and Sharaf (1961, 231) state 
that the auditor “шив! be aware of the various pressures, some obvious some subtle, which 
tend to influence [their] attitude and thereby erode slowly but surely [their] independence." 
In most instances “‘the greatest threat to [their] independence is a slow, gradual, almost 
casual erosion of [their] honest disinterestedness" (Mautz and Sharaf 1961, 208). This threat 
to independence does not only occur at the time of reporting, but also has the potential to 


The Accounting Review, May 2006 


Audit Partner Tenure and Audit Quality 657 


impact judgments made during the audit (Dopuch et al. 2003; Bazerman et al. 1997). Thus, 
it is alleged that extended personal relationships to the extent of developing bonds of loyalty 
or emotive relationships will consciously or subconsciously impact the auditor’s indepen- 
dence and objectivity. Specific arguments as to the consequence of diminished independence 
with long partner tenure include the increased likelihood of the auditor yielding to the 
inevitable client pressure in an audit conflict situation and, at the extreme, the possibility 
that excessive familiarity might result in auditor/client collusion (McLaren 1958). 

It is also argued that long audit partner tenure can lead to a familiarity threat that can 
result in a restriction on the creative audit-testing approaches that are often seen in the 
early years of an audit engagement (AICPA 1978; Hoyle 1978; McLaren 1958). The audit 
program may become stale and routine as the auditor begins to anticipate the condition of 
the client’s systems and the presence of control procedures (Hoyle 1978). Shockley (1981, 
789) asserts “complacency, lack of innovation, less rigorous audit procedures and a devel- 
oped confidence in the client may arise after a long association with the client." There is 
a tendency to anticipate results instead of being alert to subtle and often surreptitious, 
although important, anomalies (Arrunada and Paz-Ares 1997). 

There are also counter-arguments that suggest that long audit partner association with 
a particular client can result in higher audit quality. These arguments highlight the higher 
audit costs associated with early periods of auditor tenure and the increase in client and 
industry knowledge gained over repeated audits (Myers et al. 2003). 

The increased prevalence of mandatory partner rotation policies shows that the regu- 
lators believe that the potential costs associated with long periods of partner tenure outweigh 
the potential benefits. The Cadbury Committee (1992) argues that partner rotation enhances 
the technical rigor of an audit by encouraging a "fresh viewpoint." The AICPA's early 
requirement to change the partner in charge of auditing a public company at least every 
seven years "was adopted for the specific purpose of periodically bringing a fresh per- 
spective to each audit" (AICPA 1992, 4). The IFAC Code of Ethics (2003, para. 8.151) 
outlines that “using the same lead engagement partner on ап audit over a prolonged period 
may create a familiarity threat." Partner rotation is therefore prescribed as a quality control 
procedure that overcomes the problems associated with long periods of partner tenure. 

The prior discussion of a potential association between audit partner's tenure and audit 
quality suggests that in the early year(s) of tenure the partner is most likely to be indepen- 
dent and bring to the audit a fresh perspective. While the incoming engagement partner is 
developing the necessary knowledge and expertise, there is a possible increased risk of 
failure to detect material misstatements. The audit firm will attempt to reduce this risk 
through quality control processes designed to ensure the new engagement partner gains (or 
has access to) adequate knowledge and expertise. After this initial period has passed and 
the partner has gained the necessary client or industry knowledge and expertise, there is a 
period where audit quality is expected to be at its maximum. The relatively small number 
of years of association is not expected to have eroded independence or the partner's capacity 
to undertake critical appraisal. The length of this period is unknown, but the recent devel- 
opments in policy initiatives outlined earlier in this study suggest that this period is com- 
monly viewed as either five or seven years. After this period, the literature suggests a 
reduction in audit quality associated with long audit partner tenure through the development 
of personal relationships and deteriorating capacity for critical appraisal (familiarity threat). 
This association is examined by the following hypothesis: 


H1: There is a negative association between audit quality and long audit partner tenure. 
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In the analyses undertaken in the paper, long audit partner tenure is defined as situations 
where the auditor has been a partner in charge for more than seven years (TENURE>7). 
This is consistent with the period of concern currently espoused by the IFAC Code of 
Ethics, and the period of partner tenure that was commonly used in 1995 in other jurisdic- 
tions as the basis for a partner rotation policy. We also anticipate that there may be an 
initial period of familiarization for the audit partner. Thus we include the first two years of 
partner tenure as a control variable (TENURES2). This approach means that the basis of 
comparison for the later partner tenure is the period where audit quality is expected to be 
at its maximum (for this paper, between years 3 to 7 of partner tenure). Other possible 
specifications of long partner tenure are considered as forms of sensitivity analyses where 
appropriate. This includes the use of a continuous measure for partner tenure, TENURE» 5, 
and to further accentuate the effect of long periods of tenure, tenure squared. 


IV. METHODOLOGY 
Data Collection 


The data collection for this study involves the review and analysis of published infor- 
mation for public companies listed on the Australian Stock Exchange (ASX) in 1995. We 
select 1995 as the year for analysis as this is before the introduction of firm, professional, 
or legal requirements to limit audit partner tenure in Australia. For all listed companies in 
1995 we trace back the partner tenure, as well as collecting other relevant information for 
the analyses of the audit-quality measures. The population of companies listed on the ASX 
at 30 June 1995 is 1,152 (Craswell 1996). Of these, 40 foreign companies are removed 
(the audit report is signed in an overseas jurisdiction and does not necessarily identify the 
audit partner) and 66 are removed due to missing data with respect to incomplete financial 
information or an inability to trace the inception of audit partner tenure. The 25 observations 
in the banking and insurance industry are also excluded as their total asset base and financial 
structure is not comparable to those of the other companies. This results in a final sample 
of 1,021 Australian-domicile companies. 


Research Method 


To identify whether there is a negative association between long audit partner tenure 
and audit quality, we use three common measures of audit quality.? First, the auditor's 
propensity to issue a going-concern opinion for distressed companies; second, the amount 
of abnormal working capital accruals; and third, the extent to which key earnings targets 
are just beaten (missed). These are all examined over short, medium, and long periods of 
audit partner tenure. In accordance with the predicted directional effect, we utilize one- 
tailed testing in evaluating the impact of audit partner tenure. The comparison of measures 
of audit quality for the medium (when audit quality is expected to be at its maximum) with 
the long tenure periods allows us to identify any diminution in audit quality associated with 
long tenure. The examination of multiple measures of audit quality allows us to build greater 
confidence in any observed relationship between audit quality and long partner tenure. The 
research methods associated with each of these measures are discussed with their results. 


3 Other measures considered are the number of audit failures as identified by legal action undertaken against the 
auditor, or companies placed in bankruptcy soon after the issue of an unmodified audit report. The very small 
number of legal actions and bankruptcies in Australia means that these proxies are not appropriate. 
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V. DESCRIPTIVE RESULTS AND DISCUSSION 

Descriptive statistics are reported in Table 1. Of the 1,021 Australian-listed companies, 
15 percent (n = 152) have greater than seven years of continuous partner tenure.* Forty- 
four percent of companies (n = 447) have less than two years of continuous partner tenure. 
The companies have been listed on average 12.81 years and have A$299 million of assets. 
The Big 6 audit firms audit 64 percent of these clients and 7 percent of the audit opinions 
are modified relating to the going-concern issue. An examination of the correlation matrix 
reveals that apart from the expected high correlations of PBANK with LEV (0.627) and 
CFFO (—0.532), there are no other correlations greater than 0.50. 


VI. ANALYSIS 1: ISSUING GOING-CONCERN AUDIT OPINIONS 
Method 
The first measure of audit quality examined in this Чийг is that of the auditor’s рго- 
pensity to issue a going-concern modified audit opinion. Although it is not easy to associate 


TABLE 1 
Descriptive Statistics 
n = 1,021 
Variables Меп Median Std. Deviation = Minimum Maximum 
TENURE>7 0.15 0 0.36 0 1 
TENURES2 0.44 0 0.50 0 1 
PBANK : -1.64 -2.36 4.08 —7.39 33.57 
OPINION 0.07 0 0.25 0 1 
SIZE ($ in millions) 299.08 22.80 1,642.28 01 30,287.00 
AGE 12.81 9.00 12.94 1 93 
LEV 0.36 0.32 0.34 00 2.15 
LLOSS 0.38 00 0.49 0 1 
AUDFIRM 0.64 1.00 0.480 0 1 
FEERATIO 0.33 0.31 0.25 .00 95 
CFFO —0.03 0.02 0.24 -1.62 0.51 
MINING 0.37 0 0.48 0 1 
Variable Definitions: 
TENURE>7 = шы а р а эе а aa and 0 
TENURES2 = 3 if audit partner i is engagement partner on a client company less than or equal to two years, and 
otherwise; 


OPINION = 1 if company receives a going-concern modified opinion, and 0 otherwise; 
PBANK = probability of bankruptcy as measured by adjusted Zmijewski score; 
SIZE = total assets of the company at financial year-end; 
AGE = number of years since the company was listed on the Australian Stock Exchange; 
LEV = total liabilities divided by total assets; 
LLOSS = 1 if company reported a loss for the previous year, and 0 otherwise; 
AUDFIRM = 1 if audited by Big 6 firm, and 0 otherwise; 
FEERATIO = ratio of nonaudit fees to total fees paid to the incumbent auditor; 
“СЕЕО = cash flow from operations over total assets at the end of the fiscal year; and 
MINING = 1 if mining industry, and 0 otherwise. 





* Twenty-eight percent (п = 286) have more than five years of continuous partner tenure. The average audit 
partner tenure is just over four years, ranging from 1—24 years. 
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any particular type of audit opinion with a level of audit quality, there is a well-established 
body of literature that aids the prediction of going-concern modified opinions (see for 
example, DeFond et al. 2002; Mutchler et al. 1997; Hopwood et al. 1989). DeFond et al. 
(2002) argue that in issuing a going-concern opinion, the auditor must objectively evaluate 
company performance and withstand client pressure to issue a clean opinion. This suggests 
a correlation between the issuing of going-concern opinions and auditor independence. 
Thus, if after controlling for other factors that influence the propensity to issue a going- 
concern modified audit opinion, the probability of issuing going-concern reports is inversely 
related to long audit partner tenure, this suggests audit quality is diminishing with long 
partner tenure. 

Consistent with DeFond et al. (2002) and on the basis that going-concern problems are 
more salient among financially distressed companies, we restrict our analysis to situations 
where the company reports negative earnings and/or operating cash flows for the current 
fiscal year. Also, consistent with the approach of DeFond et al. (2002), the following logistic 
regression model estimates the auditor's probability of issuing a going-concern modified 
opinion to a financially distressed client: 


OPINION = B, + B,PBANK + B,SIZE + ВАСЕ + B,LEV + B,CLEV 
+ B,RETURN + B,LLOSS + BINVESTMENTS + B,AUDFIRM 


+ B,,FEERATIO + B,,CFFO + B MINING + 8,,TENURES2 
+ B,,TENURE>7 + е (D 


where: 

Dependent Variable: 

OPINION = 1 if company receives a going-concern modified opinion, and 0 otherwise; 
Experimental Variable: 


TENURE>7 = 1 if audit partner is engagement partner on a client company for greater 
than seven years, and O otherwise; 


Control Variables: 


PBANK = probability of bankruptcy as measured by adjusted Zmijewski score; 
SIZE = natural logarithm of total assets of the company at financial year-end; 
AGE = natural logarithm of number of years since listing on the Australian 

Stock Exchange; 
LEV = total liabilities divided by total assets; 
CLEV = change in LEV during the year; 
RETURN - market-adjusted return over the fiscal year; 


5 This measure is used in many studies (e.g., Carcello et al. 1995) and incorporates the financial variables found 
to be significant in bankruptcy prediction studies (e.g., Hopwood et al. 1989). This is based on the form of the 
model calculated by Carcello et al. (1995) who use the weightings b* = —4.803 —3.6(net income/total assets) 
+ 5.4(total debt/total assets) — 0.1(current assets/current liabilities). There are no significant differences in the 
results when the original form of the bankruptcy model as developed by Zmijewski (1984) is used. 
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LLOSS = 1 if client reported a loss for the previous year, and 0 otherwise; 
INVESTMENTS = short- and long-term investment securities (measured as current assets 
less debtors and inventory) divided by total assets; 

AUDFIRM = 1 if audited by Big 6 firm, and 0 otherwise; 
FEERATIO = ratio of nonaudit fees to total fees paid to the incumbent auditor; 
CFFO = operating cash flow divided by total assets; 
MINING = 1 if mining industry, and 0 otherwise; and 
TENURES2 = 1 if audit partner is engagement partner on a client company for two 
years or less, and 0 otherwise. 


The choice of control variables in the model is informed by the literature that aids the 
prediction of a going-concern opinion (in particular, DeFond et al. 2002). As outlined 
earlier, TENURES2 is used to control for instances of possible lower audit quality in the 
first two years of partner tenure due to a lack of client familiarity affecting auditor expertise. 
The other variables are consistent with the model of DeFond et al. (2002).5 PBANK is the 
probability of bankruptcy score based on Zmijewski (1984), with higher values indicating 
a higher probability of bankruptcy. SIZE is included because large companies have greater 
negotiating power and are less likely to end up in bankruptcy. AGE captures the fact that 
younger companies are more likely to encounter financial distress. LEV captures risk as- 
sociated with higher levels of debt, and CLEV captures movements in leverage that may 
move companies to unacceptable or unsustainable levels of debt. RETURN is a market- 
based measure of risk and company performance. LLOSS is included as companies with 
continued losses are more likely to fail. INVESTMENTS is a liquidity measure that captures 
the ability to quickly raise cash, while AUDFIRM captures any differences in the propensity 
of the big audit firms to issue going-concern modifications to the audit opinion. It is sug- 
gested that FEERATIO is associated with reduced audit quality (Frankel et al. 2002) and 
poor operating cash flows (CFFO) are commonly associated with a higher likelihood of 
financial distress (DeFond et al. 2002). The variable MINING is included because of the 
large number of mining companies listed in Australia and their potentially different financial 
profiles (Butterworth and Houghton 1995). 


Results 


The restriction of our analysis to situations where the client reports negative earnings 
and/or operating cash flows in 1995 results in the identification of 559 companies. The 
average partner tenure of these observations is 3.75 years (46 percent are in either years 1 
or 2 of partner tenure and 11 percent are long tenure observations of greater than seven 
years) and we observe 66 (12 percent) going-concern modified audit opinions. 

Table 2 presents results from the logistic regression model for going-concern modi- 
fied audit opinions. The model adequately distinguishes the going concern decision (x? 
= 114.15, df = 14, p = .000), and an overall fit of 88.6 percent is achieved. Consistent 
with DeFond et al. (2002), we find significance in the predicted direction at p « .05 for 
the coefficients LLOSS and INVESTMENTS, while SIZE, CLEV, RETURN, CFFO, and 
FEERATIO are not found to be significant predictors. However, in contrast to DeFond et 
al. (2002), we find significance in the predicted direction for LEV and opposite to the 
predicted direction for AGE. We do not find significance in the predicted direction of the 


5 Variables used by DeFond et al. (2002) but not included in the present study are the market variables of beta 
risk and return volatility and an indicator variable for ability to raise future financing. These variables are not 
available for many of our observations and are therefore not considered. 
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. TABLE 2 
Propensity to Issue a Going-Concern Opinion for Financially Distressed Companies 
(n = 559) 


OPINION = В, + В.РВАМК + B,SIZE + ВАСЕ + B4LEV + B,CLEV + B,RETURN 
+ B,LLOSS + BINVESTMENTS + B,AUDFIRM + B,,FEERATIO 
+ B,,CFFO + B,,MINING + B,,TENURES2 + B,,TENURE>7 + є. 


Expected Sign Coefficient (p-value) 
Experimental Variable 
TENURE>7 - —1.517 (.042)** 
Control Variables 
TENURES2 - 487 (.072)* 
РВАМК + 035 (325) 
SIZE - —.061 (591) 
АСЕ - 453 (.022)** 
LEV + 1.809 (.000)** 
CLEV + —.477 (337) 
RETURN - -.243 (298) 
LLOSS + 1.748 (.000)** 
INVESTMENTS - 22411 (.005)** 
AUDFIRM + —.630 (.053)* 
FEERATIO - —.362 (611) 
CFFO - 3187 (790) 
MINING ? —.405 (.236) 
Constant —3.063 (.108) 


Pseudo R? = 0.358 
Model Chi-square = 114.15, p < 0.00 


*, ** Significant at the 10 percent and 5 percent levels, respectively. 
Variable Definitions: 
OPINION = 1 if company receives a going-concern modified opinion, and 0 otherwise; 
TENURE>7 = 1 if audit partner is engagement partner on a client company more than seven years, and 0 


otherwise 
TENURES2 = 1 if andit partner is engagement partner on a client company less than or equal to two years, 
and 0 otherwise; 
PBANK = probability of bankruptcy as measured by adjusted Zmijewski score; 
SIZE = the natural logarithm of total assets of the company at financial year-end; 
AGE = the natural logarithm of the number of years since the company was listed; 
LEV = total liabilities divided by total assets; 
CLEV = change in LEV during the year, 
RETURN = market-adjusted return over the fiscal year; 
LLOSS = 1 if company reported a loss for the previous year, and 0 otherwise; 
INVESTMENTS = short- and long-term investment securities (measured as current assets less debtors and 
inventory) divided by total assets; 
AUDFIRM = 1 if audited by Big 6 firm, and 0 otherwise; 
FEERATIO = ratio of nonaudit fees to total fees paid to the incumbent auditor; 
CFFO = fk ee ee ставова ае 
MINING = 1 if mining industry, and 0 otherwise. 


coefficients for AUDFIRM and PBANK. The estimated coefficient for TENURE=2 is pos- 
itive but is also not significant (р = .072, one-tailed). 

The negative estimated coefficient for TENURE>7 is significant (р = .042, one-tailed) 
and in the expected direction. This result suggests that long-serving audit partners have a 
lower propensity to issue a going-concern modified opinion. This provides evidence that is 
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consistent with H1, that long audit partner tenure is negatively associated with audit quality. 
To determine the economic significance of this result, we change the value for TENURE>7 
from 0 to 1 while setting other variables in model (1) equal to their sample mean. This 
results in a 5.30 percent decrease in the probability of receiving a going-concern modified 
opinion. 


Sensitivity Analyses 

To control for the impact of a prior year going-concem opinion on an auditor’s pro- 
pensity to issue a going-concern opinion in the current year, we first replicate the primary 
logistic regression analysis, substituting first-time going-concern opinions as the dependent 
variable, as in DeFond et al. (2002). Of the 66 going-concern modifications. there are 33 
first-time going-concern opinions. The model is a significant predictor of first-time going- 
concern modifications (x? = 49.54, df = 14, р = .000) and results are broadly consistent 
with the analysis containing all going-concern modifications. The TENURE» 7 variable has 
a negative coefficient but its significance is now marginal (p — .075, one-tailed). Second, 
we rerun the primary analysis including a dummy variable for whether the company re- 
ceives à prior-year going-concern modification. Similar results to those of the analysis of 
the first-time going-concern opinions are obtained, with TENURE>7 having a negative 
coefficient and again being marginally significant (p — .098, one-tailed). 

Identifying that not all loss making or cash flow negative firms will receive a going- 
concern modified opinion (only 66 of the 559 companies received going-concern modifi- 
cations), we extend this analysis to consider those companies most likely to receive a going- 
concern modified opinion. We therefore examine the most severely distressed quartile (140 
companies) of the financially stressed companies. The going-concern modified opinion is 
issued in 17 of 43 (39.5 percent) instances for short tenure, 27 of 80 (33.8 percent) instances 
for medium tenure, and 2 of 17 (11.8 percent) instances for long tenure (greater than seven 
years). Using a test of proportions, we identify that there is no significant difference between 
the proportions for short and medium tenure (z = 0.64, p > .05), while the difference 
between medium and long tenure is significant (z = 1.72, p = .043, one-tailed). This 
provides evidence of a lower propensity to modify the audit opinion for the most severely 
financially stressed companies associated with long audit partner tenure. 

To further investigate if the auditor's propensity to issue a modified audit report di- 
minishes with tenure, we substitute the two tenure variables (TENURES2, TENURE>7) 
with a single linear TENURE variable, while controlling for instances of possible lower 
audit quality in the first two years of partner tenure by excluding the 258 TENURES2 
observations. TENURE remains negative and significant (р = .012, one-tailed),° indicative 
of a lower propensity to issue a going-concern modification during long partner tenure. 

To explore the potential interaction between AUDFIRM and long partner tenure, we 
include an interaction term AUDFIRM*TENURE in model (1). TENURE is measured as a 
continuous variable!? and observations where tenure is less than two years are excluded. 
The inclusion of the interaction term results in TENURE remaining significant and negative 
(B = —.509, p = .005) while the interaction term is significant and positive (B = .544, 





? The 33 ongoing going-concern modification observations are excluded from the analysis. 

* Those companies with the greatest possibility of bankruptcy are identified by applying model (1). This quartile 
received 46 of the 66 (69.7 percent) going-concern modified audit opinions. 

? When we square the tenure variable to accentuate the effect of partner tenure in the full sample of 559 financially 
distressed companies, the coefficient remains negative and highly significant (p — .003, one-tailed). 

10 We use the continuous TENURE measure because the small number of going-concern modifications for TENURE 
> 7 observations results in a large error term and unreliable results. 
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p = .021). This analysis shows that the significant finding for TENURE identified in the 
earlier analysis is due to non-Big 6 firms having a lower propensity to issue a going-concern 
modified opinion than Big 6 firms in the later years of partner tenure.!! 


VII. ANALYSIS 2: ABNORMAL WORKING CAPITAL ACCRUALS 

Method 

Consistent with prior literature, we posit that higher-quality audits mitigate extreme 
management reporting decisions. Accruals are commonly used to. identify these extreme 
reporting decisions (Myers et al. 2003; Francis et al. 1999; Becker et al. 1998). We con- 
centrate on abnormal working capital accruals (AWCA) and utilize the proxy measure of 
AWCA developed by DeFond and Park (2001). AWCA is the difference between realized 
Working capital and an expected level of working capital needed to support a current sales 
level, where an historic relation of working capital to sales captures expected working 
capital. In their analysis, DeFond and Park (2001, 399) find their measure to be a more 
powerful test in comparison to a test using total (normal plus abnormal) accruals. The focus 
on working capital accruals is further supported by prior research that suggests that man- 
agement has the most discretion over such accruals (Ashbaugh et al. 2003; Becker et al. 
1998). 

The measure of AWCA is: 


AWCA, = WC, — [WC,.,/5, ) X SJ 
where: 


t = year, t—1 refers to the prior year; 
WC, — non-cash working capital in the current year computed as (current assets — cash 
and short-term investments) — (current liabilities — short-term debt); 
WC, , = non-cash working capital in the prior year; 
5, = sales in current year; and 
$,., = sales in prior year. 


We scale all variables by average total assets (Myers et al. 2003). 

We examine the signed and absolute amounts of AWCA, which are indicators of the 
degree to which management is allowed to exercise discretion in reporting earnings. If 
reduced audit quality occurs in the later years of audit partner tenure, then we expect larger 
(income-increasing) signed and higher unsigned (absolute) amounts of AWCA in these later 
years (Myers et al. 2003; Frankel et àl. 2002; Reynolds and Francis 2000; Francis et al. 
1999; Becker et al. 1998). 

We estimate the following model: 


п For the Big 6 sample the qualification rate is 8 of 133 observations (6.0 percent) for medium tenure compared 
with 2 of 34 observations (5.9 percent) for long tenure. For the non-Big 6 sample tbe qualification rate is 24 of 
106 observations (22.6 percent) for medium tenure and 0 of 28 observations (0 percent) for long tenure. 
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AWCA = В, + B,AUDFIRM + B,PBANK + BOPINION + B,SIZE + B,LEV 
+ B4LLOSS + B,FEERATIO + В,РЕКРОКМ + B,AGE 
+ B,.GROWTH + 8,,CFFO + B MINING + B,,TENURES2 
+ B,,TENURE>7 + € (2) 


where: 

Dependent Variable: 

AWCA = abnormal working capital accruals (measured in signed or absolute amount); 
Additional Control Variables: 


PERFORM” = earnings before tax over total assets at the end of the fiscal year; and 
GROWTH = change in assets from prior year. 


The choice of control variables is informed by prior studies that examine aspects of 
audit quality by measuring the level of abnormal accruals, in particular those conducted by 
Myers et al. (2003), Johnson et al. (2002), and Frankel et al. (2002). A number of the 
variables (PBANK, SIZE, LEV, LLOSS, AUDFIRM, OPINION, FEERATIO, AGE, CFFO, 
MINING) contained in the going-concern model outlined in the previous section are ex- 
pected to be associated with both going-concern prediction and the level of abnormal ac- 
cruals. Two of the control variables in the going-concern model (INVESTMENTS and 
CLEV), are not used in abnormal accrual studies and, thus, are not included in this study. 
Two additional variables are included. As abnormal accrual models do not completely 
extract nondiscretionary accruals that are correlated with company performance (e.g., 
Dechow et al. 1995; Ashbaugh et al. 2003), we include a measure of performance to control 
for the nondiscretionary component of AWCA not extracted by the model (PERFORM). As 
accruals are likely to be associated with a company’s growth opportunities (Johnson et al. 
2002), GROWTH is controlled for using the change in assets from the prior year. 


Results 


The sample for which AWCA are calculated is reduced from 1,021 to 743 companies. 
Fifty-six companies are deleted because the required prior-year information is not available 
(newly listed companies), and 222 because the company does not report a consistent pattern 
of sales (revenue) between the current and previous fiscal years.'? 

Table 3 presents the results of estimating a model examining the relationship between 
AWCA (as an indicator of earnings management) and audit partner tenure. With respect to 
the control variables, CFFO is negatively associated with AWCA (consistent with Myers et 
al. [2003] and Johnson et al. [2002]). Riskier companies and more highly levered companies 
are also found to have higher levels of signed AWCAs. There is some support that signed 


12 When we substitute PERFORM with the absolute value of PERFORM, there is no significant differences in the 


results. 
із Unstable sales revenue is determined as zero revenue in any year, or less than 10 percent of the comparison 
year's sale revenue. 
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TABLE 3 
OLS Regression Results for Abnormal Working Capital Accruals 
(п = 743) 


AWCA = B, + BjAUDFIRM + В,РВАМК + B,OPINION + B,SIZE + B4LEV + BgLLOSS 
+ B,FEERATIO + B,PERFORM + B,AGE + B GROWTH + B,,CFFO 
+ B,,MINING + B,,TENURES2 + B,TENURE»7 + e. 


Signed AWCA Absolute AWCA 


Expected a eee Еды ыыы ЈН 
Variables Sign Coefficient t-value Coefficient t-value 
Experimental Variable 
TENURE>7 + —.017 (—.247) —.027 (—.454) 
Control Variables 
TENURES2 + .087 (1.648)* .030 (.658) 
PBANK + —.118 (—2.333)** —.002 (—.049) 
OPINION + 148 (1.207) 112 (1.063) 
SIZE - 012 (853) -- 028 (—2.274)** 
LEV + 604 '  (2.002)** .129 (.497) 
LLOSS + 047 (.757) 14 (2.672)** 
AUDFIRM = 072 (1.352) .001 (.017) 
FEERATIO - —.193 (—1.842)* .108 (1.202) 
PERFORM 7 —.304 (-1.113) 173 (.735) 
АСЕ Е .031 (1.099) .027 (1.109) 
GROWTH + -.004 (—.323) —.012 (—1.004) 
CFFO = —.324 (-2.023)** -.305 (—-2.212)** 
MINING ? - 034 (–.595) ` .036 732 
Constant —.783 (—2.348)** 481 (1.672)* 
Adj. R? 012 040 


*, ** Significant at the 10 percent and 5 percent levels, respectively. 
Variable Definitions: 
AWCA = abnormal working capital accruals (measured in signed or absolute amount); 
TENURE>7 = 1 if audit partner is engagement partner on a client company more than seven years, and 0 
otherwise; 


TENURES2 = 1 if audit partner is engagement partner on a client company less than or equal to two years, 
and 0 otherwise; 
PBANK = probability of bankruptcy as measured by adjusted Zmijewski score; 
OPINION = 1 if company receives a going-concem modified opinion, and 0 otherwise; 
SIZE = the natural logarithm of total assets of the company at financial year-end; 
LEV = total liabilities divided by total assets; 
LLOSS = 1 if company reported a loss for the previous year, and 0 otherwise; 
AUDFIRM = 1 if audited by Big 6 firm, and 0 otherwise; 
FEERATIO = ratio of nonaudit fees to total fees paid to the incumbent auditor; 
PERFORM - earnings before tax over total assets at the end of the fiscal year; 
AGE - the natural logarithm of the number of years since the company was listed; 
GROWTH - change in assets from prior year; 
CFFO - cash flow from operations over total assets at the end of the fiscal year; and 
MINING = 1 if mining industry, and 0 otherwise. 


AWCAs are higher in the earlier years of partner tenure. Smaller companies and those with 
a prior year loss have higher absolute levels of AWCA. 

If the level of earnings management increases as audit partner tenure increases, then 
we expect higher levels of signed and absolute accruals during the later years of partner 
tenure. For both the measures of AWCA (signed and absolute), the models are significant 
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and the amount of explained variation ranges from 1.2 percent to 4.0 percent. For both 
models, TENURE>7 is not significant, suggesting that earnings management as measured 
by AWCA is not associated with long partner tenure. 

Because of asymmetric relationships between the independent variables and positive 
and negative abnormal accruals, income-increasing and income-decreasing abnormal accru- 
als are analyzed separately (Menon and Williams 2004;. Myers et al. 2003). Accordingly 
we separate the full sample into companies with positive (income-increasing) and negative 
(income-decreasing) AWCA and estimate the regression models separately on these two 
samples.!^ For the reduced sample of income-increasing AWCAs (п = 398), the explanatory 
power of the model improves (adjusted В? = .048) and TENURE? 7 is not significant (t 
= —.069, p > .05). For the sample of income-decreasing AWCAs (n = 345), the explana- 
tory power of the model is lower (adjusted R? = .022) and TENURE 7 is not significant 
(t = —.150, p > .05).5 The results therefore suggest no deterioration in audit quality 
associated with long partner tenure, for either income-increasing or -decreasing AWCA. 

Overall, our results do not find that long audit partner tenure is associated with either 
a deterioration in audit quality (as hypothesized), or an improvement in audit quality (as 
suggested by Myers et al. [2003] when examining audit firm tenure), when audit quality is 
measured by its association with the signed or absolute amounts of AWCA. 


Sensitivity Analyses 

The fact that TENURE>7 is not significant may be caused by the small number of 
long tenure observations. Substituting the experimental variables with a binary variable 
measuring tenure greater than five years (TENURE 5) again finds no evidence that long 
partner tenure is associated with reduced audit quality. 

As income-increasing AWCA are associated with lower audit quality (Menon and 
Williams 2004; Myers et al. 2003; DeFond and Park 2001; Becker et al. 1998; DeFond 
and Jiambalvo 1993), we also examine the sign (income-increasing versus income- 
decreasing) of AWCA. The logistic regression model is significant, and the amount of ex- 
plained variation is 5.7 percent. Neither TENURE» 7 or TENURE-2 are significant. This 
suggests that earnings management as measured by the sign of AWCA is not associated 
with partner tenure. 

To further investigate whether audit partner tenure is associated with AWCA, we replace 
the TENURES2 and TENURE>7 variables with a single linear TENURE variable and 
control for instances of possible lower audit quality in the first two years of partner tenure 
by excluding the 296 TENURES2 observations. Again, no evidence is found to suggest 
that long periods of partner tenure are associated with reduced audit quality.!é 

In the analyses undertaken, AUDFIRM is not significant despite a good a priori reason 
to believe that long tenure plays out differently for larger audit firms (as previously dis- 
cussed under sensitivity analysis for Section VI). To further explore the potential interaction 
between long partner tenure and AUDFIRM, we use the same approach as that used in 
Section VI. This involves excluding the first and second year of tenure observations and 
including TENURE as a continuous variable and AUDFIRM*TENURE as an interaction 
term in model (2). Both TENURE and the interaction term are insignificant. This suggests 


14 As the samples are truncated at one end of the distribution for both regressions, we use a maximum likelihood 
truncated regression procedure to obtain corrected coefficients. 

15 TENURE 2 із not significant in either analysis. 

16 When we square the tenure variable to accentuate the effect of partner tenure, the coefficient remains 
nonsignificant. 
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that there are no differences in the signed or absolute level of accruals over different levels 
of tenure for Big 6 versus non-Big 6 audit firms. 

As part of the sensitivity analysis, we also run a variation of the definition of AWCA, 
utilizing the measure of current accruals used by Myers et al. (2003).!” This is: 


Curr_Accruals = ((ACurr_Ass — ACash) — (ACurr_Liab — ASTD)) 
where: 


ACurr_Ass = change in current assets; 
ACash = change in cash equivalents; 
ACurr_Liab = change in current liabilities; and 
ASTD = change in short-term notes and current portion of long-term debt. 


Using this measure of current accruals (either signed or absolute) as the dependent variable 
for model (2) does not change the insignificance of the TENURE>7 variable. 


УШ. ANALYSIS 3: JUST BEATING OR MISSING EARNINGS BENCHMARKS 
Method 


For this analysis, the quality of audited financial information is measured by the extent 
of earnings management. This is evidenced by whether the earnings just beats or misses 
specific benchmarks, and the extent to which this event is associated with auditor tenure. 
Benchmark studies find (1) companies avoid reporting a loss; (2) an unusually low fre- 
quency of small decreases in earnings and unusually high frequency of small increases in 
earnings;!? and (3) an unusually high frequency of companies meeting or exceeding ana- 
lysts’ forecasts (Menon and Williams 2004; Holland and Ramsay 2003; Matsumoto 2002; 
Degeorge et al. 1999; Burgstahler and Dichev 1997; Hayn 1995). We utilize the first two 
measures?’ in order to identify the association between earnings management and partner 
tenure. The formulae are: 


(a) Avoid reporting a loss = Earnings, Total Assets; and 
(b) Exceed prior year's earnings = (Earnings, — Earnings, ,)/Total Assets, ,. 


We utilize the model used in Menon and Williams (2004) on meeting or missing bench- 
marks to investigate the relationship between audit partner tenure and audit quality. We 
therefore develop the following multivariate logistic regression models: 


Y = В, + B,AUDFIRM + B;PBANK + B,SIZE + В.АСЕ + B,MKTVAL 
+ B,MINING + B,TENURE<2 + B,TENURE»7 + є (3) 


17 This measure has the advantage that it is possible to retain the 222 companies that do not report a consistent 
pattern of sales between the current and previous fiscal years. (The analysis reported here does not include these 
companies, but they are included for the sensitivity analysis contained in Section IX.) The measure however 
has the disadvantage of not relating these accruals to a level of activity for the company, and therefore fails to 
identify abnormal working capital accruals for that company. 

18 DeAngelo (1986) finds evidence of strong market incentives to maintain consistent earnings increases, and Barth 
et al. (1999) observe that managers focus on maintaining steadily increasing earnings. In a field study, Nelson 
et al. (2002) also provide evidence that managers are more likely to make attempts that increase current year’s 
income. 

19 The availability of analysts’ forecasts is limited to only the very large companies in Australia in 1995, resulting 
in insufficient degrees of freedom to undertake this analysis. 
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where Y is defined in four different models as Pr(BEATS_BE = 1), Pr(MISSES_BE = 1), 
Pr(BEATS_LYR = 1), and Pr(MISSES..LYR = 1). 


Additional Control Variable: 
MKTVAL = the natural logarithm of the market value of equity. 


Using a similar method of analysis to Menon and Williams (2004), there are two 
regression analyses for the first benchmark (company performance around breakeven). In 
the first regression analysis the dependent variable BEATS. BE = 1 when profit is less than 
2 percent of total assets, and 0 otherwise. In the second regression analysis the dependent 
variable MISSES. BE = 1 when loss is less than 2 percent of total assets, and 0 otherwise. 
For the second benchmark (last year’s profit), the dependent variable BEATS_LYR = 1 
when the increase in profit (decrease in loss) over last year's profit/loss is less than 2 
percent of total assets, and 0 otherwise. We then undertake the analysis where the dependent 
variable MISSES_LYR = 1 when decrease in profit (increase in loss) over last year's profit/ 
loss is less than 2 percent of total assets, and 0 otherwise. If clients with long tenure 
partners are granted more discretion than other clients, then we expect В, to be positive 
and significant for the analyses of the just beats the benchmarks (breakeven and last year's 
profit) and negative and significant for the analyses of the just misses the benchmarks. 

To determine whether the just beats or misses specific earnings benchmarks are asso- 
ciated with long partner tenure (TENURE 7), we control for the age of the client (AGE) 
and variations in the size of clients by using both the natural logarithm of total assets (SIZE) 
and market value of equity (MKTVAL) (as in Menon and Williams 2004). In addition to 
TENURE-2 we include three additional variables not in Menon and Williams (2004). As 
previously discussed, audit firm type is expected to constrain earnings management and is 
controlled for by including the variable AUDFIRM. PBANK, which is measured by an 
adjusted Zmijewski score, is also expected to be associated with earnings management. To 
control for the high proportion of mining companies in Australia we include the variable 
MINING. The analysis is performed on the population of 1,021 company observations. 

Table 4, Panel A shows the frequency with which short, medium, and long partner 
tenure are associated with just beats versus just misses breakeven. For short partner tenure, 
the proportion of the combined just beats and just misses observations that do just beat 
breakeven is 26 of 43 (60.5 percent). For medium tenure, this is 22 of 45 (48.9 percent) 
and for long tenure, this is 11 of 15 (73.3 percent). The proportion of long tenure versus 
medium tenure observations that just beats breakeven versus just misses breakeven is sig- 
nificant (z — 1.65, p — .05, one-tailed). This is consistent with the view that there is a 
lower audit quality associated with long audit partner tenure. 

Separate regression analyses are calculated for the two dependent variables; just misses 
breakeven and just beats breakeven. The results of these are contained in Table 4, Panel B. 
Consistent with the descriptive results above, TENURE 7 is positive but not significant for 
the just beats breakeven analysis, and negative and approaching significance for the just 
misses breakeven. (TENURES2 is also negative and approaching significance for the 
just misses breakeven.) 

In the second analysis, which examines the just beats (just misses) last year's profit 
(by less than 2 percent of total assets), the 56 first-year listing observations are deleted, 
leaving 965 observations in the analysis. Table 5, Panel A shows the frequency with which 
short, medium, and long partner tenure are associated with just beats versus just misses 
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TABLE 4 
Just Beats and Just Misses Breakeven 

(п = 1,021) 

Panel A: Earnings Benchmark Frequencies and Statistics 
ТЕМОКЕ =2 Medium Tenure TENURE>7 Total 

МРТА < —2% 196 (43.9%) 178 (42.2%) 48 (31.6%) 422 
NPTA —2 to 0% 
(just misses) 17 (3.8%) 23 (5.5%) 4 (2.6%) 44 
NPTA 0 to 2% 
Gust beats) 26 (5.8%) 22 (5.2%) 11 (7.2%)t 59 
NPTA > 2% 208 (46.5%) 199 (47.1%) 89 (58.6%) 496 
Total 447 (100%) 422 (100%) 152 (100%) 1,021 


Panel B: Logistic Regression Results 


Pr(BEATS_BE or MISSES_BE = 1) = В, + B,AUDFIRM + В,РВАМК + B,SIZE 
+ ВАСЕ + B,MKTVAL + B,MINING 
+ B,TENURES2 + B,TENURE»7 + в. 


Just Beats Breakeven Just Misses Breakeven 
Exp. Sign Coefficient Sig. Exp. Sign Coefficient Sig. 


Experimental Variable 
TENURE > 7 + 445 .134 = —.778 .085* 
Control Variables 
TENURE 2 + —.147 321 - —.514 .073* 
PBANK + - 47 212 = —.058 .086* 
SIZE - 851 .000** t 225 240 
AUDFIRM = .224 480 + —.045 .893 
AGE = —.355 .017** + —.137 428 
MKTVAL + —.730 .000** - —.256 4169 
MINING ? —.112 Л44 ? 555 102 
Constant —4.857 .001** —2.416 122 
Adj. В? (Pseudo В?) 094 032 


f Significant difference from medium tenure at the 5 percent level (one-tailed) for the proportion of just beats to 
just misses. 
, ** Significant at the 10 percent and 5 percent levels, respectively. 
Variable Definitions: 
NPTA = net profit to total assets; 
BEATS_BE = 1 if profit is less than two percent of total assets, and 0 otherwise; 
MISSES_BE = | if where loss is less than 2 percent of total assets, and 0 otherwise; 
ТЕМОКЕ>7 = 1 if audit partner is engagement partner on a client company more than seven years, and 0 


otherwise; 
TENURES2 = 1 if audit partner is engagement partner on a client company less than or equal to two years, 
and 0 otherwise; 
PBANK = probability of bankruptcy as measured by adjusted Zmijewski score; 
SIZE = the natural logarithm of total assets of the company at financial year—end; 
AUDFIRM = 1 if audited by Big 6 firm, and 0 otherwise; 
AGE = the natural logarithm of the number of years since the company was listed; 
MKTVAL = the natural logarithm of the market value of equity; and 
MINING = 1 if mining industry, and 0 otherwise. 
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ТАВГЕ 5 
Just Beats and Just Misses Last Year’s Profit 
(п = 965) 
Panel A: Earnings Benchmark Frequencies and Statistics 
TENURES2 Medium Tenure | TENURE-7 ` Тош 
ANPTA < —2% 142 (36.3%) 149 (35.3%) 45 (29.6%) 336 
ANPTA —2 to 0% 
(just misses) 47 (12.096) 49 (11.6%) 15 (9.9%) 111 
ANPTA 0 to 2% из 
(just beats) 59 (15.1%)1 94 (22.3%) 37 (24.3%) 190 
ANPTA > 2% 143 (36.6%) 130 (30.8%) 55 (36.2%) 328 
Total 391 (100%) 422 (100%) 152 (100%) 965 


Panel B: Logistic Regression Results 


Pr(BEATS_LYR or MISSES..LYR = 1) = В, + B,AUDFIRM + В.РВАМК + p4SIZE 
+ B,AGE + BMKTVAL + B,MINING 
+ В. ТЕМОКЕ=2 + B,TENURE>7 + є. 
Just Beats Last Year Just Misses Last Year 
Exp. Sign Coefficient Sig. Exp. Sign Coefficient Sig. 


Experimental Variable 
TENURE > 7 + —.111 329 = — 302 175 
Control Variables 
TENURE = 2 + —.337 .047** — .093 342 
РВАМК + —.034 ‚165 Е —.034 .238 
SIZE — .188 .106 + 512 .000** 
AUDFIRM - —.276 .172 t —.023 .920 
AGE = .096 .345 + .041 731 
MKTVAL T 201 .078* = —.338 .006** 
MINING ? — 1.003 .000** .162 .496 
Constant —7.853 .000** —5.415 .000** 
Adj. R? (Pseudo В?) 211 .051 
` t Significant difference from medium tenure at the 5 percent level (one-tailed) for the proportion of just beats to 
just misses. 
`*, ** Significant at the 10 percent and 5 percent levels, respectively. 
Variable Definitions: 


ANPTA = change from last year in net profit to total assets; 

ВЕАТУ_ГУК = 1 if increase over last year's earnings is less than 2 percent of total assets, and 0 otherwise; 
MISSES_LYR = 1 if decrease from last year’s earnings is less than 2 percent of total assets, and 0 otherwise; 
TENURE>7 = 1 if audit partner is engagement partner on a client company more than seven years, and 0 

otherwise; 
TENURE-2 = 1 if audit partner is engagement partner on a client company less than or equal to two years, 
and 0 otherwise; 
PBANK = probability of bankruptcy as measured by adjusted Zmijewski score; 
SIZE = the natural logarithm of total assets of the company at financial year-end; 
AUDFIRM = 1 if audited by Big 6 firm, and 0 otherwise; 
AGE = the natural logarithm of the number of years since the company was listed; 
MKTVAL = the natural logarithm of the market value of equity; and 
MINING = 1 if mining industry, and 0 otherwise. 
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last year’s profit figure. For short partner tenure the proportion of the combined just beats 
and just misses observations that do just beat last year’s profit is 59 of 106 (55.7 percent). 
For medium tenure this is 94 of 143 (65.7 percent), and for long tenure this is 37 of 52 
(71.2 percent). The proportion of long tenure versus medium tenure observations is not 
significant, suggesting no diminution in audit quality associated with long audit partner 
tenure. Separate regression analyses for the just beats (just misses) last year’s profit are 
contained in Table 5, Panel B. Results also do not support the hypothesized deterioration 
in audit quality associated with long audit partner tenure, with TENURE>7 not significant 
for either the ''beats last year" or “misses last years” profit. 


Sensitivity Analyses 

The beats/misses results provide limited support for the hypothesized deterioration in 
audit quality with long audit partner tenure. This may be caused by the small number of 
long tenure observations for the TENURE>7 variable. Substituting TENURE>7 with a 
binary variable measuring partner tenure greater than five years (TENURE>5), long tenure 
is no longer approaching significance in the BEATS or MISSES. BE analyses (and continues 
to be not significant in the BEATS or MISSES. LYR analyses). This result suggests that 
observations for partner tenure of six or seven years behave more like medium than long 
tenure observations. 

To further investigate if the propensity for company earnings to just beat or miss specific 
benchmarks is associated with audit partner tenure, we substitute the two tenure variables 
(TENURES2, TENURE>7) with a single linear TENURE variable. We also control for 
instances of possible lower audit quality in the first two years of partner tenure by excluding 
the 447 TENURES2 observations. For the just beats last year's profit analysis, we find the 
effect of TENURE is positive and approaching significance (p — .077, one-tailed) while it 
remains insignificant for the other three benchmark analyses.? 

To explore the potential interaction between long partner tenure and AUDFIRM, we 
replicate the approach used for the going-concern audit opinion and AWCA analyses. Ex- 
cluding first- and second-year tenure observations and running TENURE as a continuous 
variable in place of TENURE 7, we find a significant AUDFIRM*TENURE interaction for 
one of the analyses, ће just beats last year's profit regression. TENURE is positive and 
significant (p — .008, one-tailed) and AUDFIRM*TENURE is negative and significant (p 
— .010, one-tailed). This suggests that non-Big 6 audit firm clients are more likely to just 
beat last year's profit?! The interaction term AUDFIRM*TENURE is insignificant for the 
other three analyses. 

Further sensitivity analyses are undertaken where the cutoff for BEATS or MISSES 
breakeven and prior-year earnings is changed from 2 percent to either 1 or 3 percent. 
Varying the cutoff to either 1 or 3 percent produces similar results to the primary analyses. 

In summary, the results of our analysis of tbe distribution of reported earnings provides 
some evidence suggesting a deterioration in audit quality associated with long audit partner 
tenure. Consistent with results from the going concern analysis (Section VI), sensitivity 


20 When we square the tenure variable to accentuate the effect of partner tenure, the coefficients for TENURE do 
not change in significance. | | 

31 For the 94 of 143 medium tenure observations in the range of +/—2 percent ANPTA, 70 of 101 (69 percent) 
of the companies audited by the Big 6 firms just beat last year’s NPTA, compared with 24 of 42 (57 percent) 
for companies audited by non-Big 6 firms. For the 37 of 52 long tenure observations, 16 of 26 (62 percent) of 
the companies audited by the Big 6 firms just beat last year’s NPTA, compared with 21 of 26 (81 percent) for 
companies audited by non-Big 6 firms. 
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analysis around beats (misses) last year’s profit suggests the deterioration in audit quality 
is associated with the non-Big 6 firms. 


IX. OVERALL SENSITIVITY ANALYSIS: COMPARISON OF THE THREE 
METHODS USING CONSTANT SAMPLE 

One of the possible reasons for the difference in the results between the three methods 
of analyses is the variations in sample size. We therefore rerun all the analyses using the 
sample of 965 used in the just beating or missing last year’s earnings, which involves 
deleting the 56 new listings from our sample of 1,021. We find that under this approach 
TENURE>7 remains significant for the going concern analysis (p = .038, one-tailed) for 
the reduced sample of 532 distressed companies contained in the sample of 965. With 
regard to the accruals models, while we are unable to rerun the AWCA because it requires 
last year’s sales as a base level of activity, we are able to run the current accruals model 
on the 965 observations. We find that TENURE>7 remains insignificant for both the signed 
and absolute measures of current accruals. With regard to the earnings benchmarks for 
breakeven, there is little difference in the result, with TENURE>7 remaining negative for 
the just misses (р = .087, one-tailed) and positive but insignificant for the just beats. It 
therefore appears that the differences in findings for the different measures of audit quality 
are not a result of different sample compositions. 


X. CONCLUSION 

One of the main policy initiatives that has been emphasized in recent times in order to 
address concerns about audit quality is mandatory rotation of the audit partner. This study 
provides evidence on the underlying rationale for an audit partner rotation policy: there is 
a reduction in audit quality associated with long tenure. Examining partner tenure in an 
environment where the name of the engagement partner is publicly disclosed on the audit 
report and at a time when a policy of partner rotation is not mandated allows for an 
empirical examination of evidence to support this rationale. We examine 1995, a time before 
audit partner tenure became a professional, regulatory or audit firm requirement in Australia. 

` We examine the relationship between audit partner tenure and three proxies for audit 
quality. These are, first, auditors' propensity to issue going-concern audit opinions for dis- 
tressed companies; second, an examination of the signed and absolute amounts of abnormal 
working capital accruals; and third, the extent to which key earnings benchmarks are just 
beaten and just missed. 

In examining going-concern modified opinions, we find that the auditor's propensity to 
issue such opinions do diminish over the audit partner's tenure, after controlling for other 
factors which influence this propensity. Sensitivity analyses suggest that the decrease in 
propensity to qualify with long tenure is associated with non-Big 6 firms. 

We find no evidence of an association of either the signed or absolute amount of 
abnormal working capital accruals with long audit partner tenure. This finding is inconsis- 
tent with Chi and Huang (2005) who find some support for the view that long audit partner 
tenure is associated with reduced earnings quality. Although both studies are undertaken in 
a pre-mandatory period, these differences may be a result of cultural disparities, and vari- 
ations in the measurement of audit partner tenure and earnings quality. 

In examining key earnings targets we find some evidence of just beating (missing) 
earnings benchmarks for long tenure observations. A lower proportion of clients miss break- 
even for long partner tenure observations, suggesting a greater ability to manage earnings 
in order to report a profit during later years of partner tenure. Also, sensitivity analysis 
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around: beats (misses) last year’s reported profit suggests deteriorating audit quality is more 
likely to be associated with the clients of non-Big 6 firms. When these results are combined 
with the findings of the abnormal working capital accruals analysis, it appears that mech- 
anisms. other than working capital accruals variations are being used to achieve these 
outcomes. 

In conclusion, for the measures of audit quality examining the auditor’s propensity to 
issue a going-concern audit opinion and just meeting (missing) earnings benchmarks, we 
find evidence consistent with a diminution in audit quality associated with long audit partner 
tenure. These results provide support for the introduction of a rotation policy after seven 
years of partner tenure. Sensitivity analyses show that the evidence of deterioration in audit 
quality seems to pertain mainly to non-Big 6 auditors. This is consistent with prior studies 
that provide evidence of higher quality audits provided by Big 6 firms (for example, Francis 
et al. 1999; Becker et al. 1998). 

The analysis is conducted during a period before there was mandatory rotaticn of audit 
partners. Therefore, we cannot necessarily expect that outcomes of the audit process ob- 
served over a long period of tenure are the same as those observed under any legislated 
period (1.е., five or seven years) of limited tenure. Our study is also not relevant to the 
consideration of whether a shorter five-year rotation period should be adopted, as has re- 
cently occurred in many jurisdictions. It is possible that the behavior of audit partners might 
be impacted by the knowledge that their tenure as audit partner will end at a specified date. 
Future research might investigate the impact on audit quality of the introduction of such a 
policy and any variations in its requirements. 
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ABSTRACT: At the heart of decentralization lies the notion that tasks are delegated by 
owners to managers who possess superior local Information. The extent of thls infor- 
mation asymmetry Is often an endogenous construct, as it is influenced by the owner's 
choice of internal accounting systems and the manager's investment in acquiring local 
expertise. In this paper, we explore how varying levels of pre-contract, asymmetric 
information affect the owner-manager relationship. We provide three- main sets of In- 
sights. First, we find that the owner's payoffs are Initlally decreasing, and strictly convex 
everywhere, In the quality of the manager's private information. The owner thus prefers 
to deal with either a perfectly informed or a perfectly uninformed manager, and we 
characterize conditions for either to be the preferred choice. Second, in contrast to 
recent work, we demonstrate that when information can be communicated Internally, 
the optimal strength of managerial Incentives unambiguously decreases as the man- 
ager becomes better informed. Third, we derive the surprising result that a self- 
interested manager does not always prefer to maximize his informational advantage. 
Our work has Implications for the optimal design of organizatlons, and for Internal 
accounting and control systems in particular. 


I. INTRODUCTION 

n issue of central interest in management accounting is the choice of information 
Av to aid managers in their decision making. In decentralized firms, decisions 

are delegated to managers who may have information superior to that of the owners 
of the firm. Some of this information superiority arises due to the manager's expertise 
regarding local conditions, such as his knowledge of consumer preferences in specific mar- 
kets or for specific product attributes. However, the relative information advantage of the 
manager is also a function of investments made by the firm in internal information systems 
(e.g., the installation of ERP systems or the acquisition of market survey data), as well as 
investments made by the manager in acquiring greater local expertise. In settings where 
the information generated by these systems can be monitored by the owner, it is clear that, 
holding cost considerations aside, more information is better. However, in decentralized 
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environments where the realized information signals are privately observed by managers, 
owners may. prefer to deal with managers whose information is less than perfect. The reason 
is that while managers can potentially use additional information to make better decisions 
for the firm, inducing them to do so against their self-interest may require excessive levels 
of additional compensation. 

In this. paper, we analyze the effects of contracting with decentralized managers who 
possess varying levels of pre-contract informational superiority. We first propose a measure 
of informational quality and validate it by demonstrating that first-best profit levels are 
increasing:and strictly convex in this metric. We then analyze the efficient second-best 
outcome and show that the owner’s profit is initially decreasing and strictly convex every- 
where as a function of the manager’s private information quality. An immediate implication 
is that the firm’s owner always prefers for the manager to be either perfectly informed or 
perfectly uninformed. Ignoring cost considerations, the owner will thus never offer the 
manager some, but not all of the information available. Moreover, giving an uninformed 
manger ап infinitesimal amount of additional information is always detrimental to the 
owner. Whether the owner will prefer for the manager to be perfectly informed ог perfectly 
uninformed depends on how much the underlying states of the world vary. When the states 
of the world vary greatly or there is a lot of uncertainty regarding the actual state, we show 
that the benefits.to informing the manager are sufficient to overcome any accompanying 
informational rents. On the other hand, when the states of nature are sufficiently similar, 
the.owner chooses not to inform the manager because this precludes the latter from earning 
any informational rents, whereas the consequences of the manager making an uninformed 
decision are negligible. 

We show that the above results hold regardless of whether the manager's contractible 
output is deterministic or stochastic. In both settings, we allow for full communication of 
the manager's information using menus of contracts. In the stochastic model, we employ 
menus of linear contracts, which allows us to measure the effect of information on incentive 
strength. In contrast to recent work that has studied this issue in settings with no commu- 
nication, we show that the optimal strength of managerial incentives always decreases as 
the manager's level of private information increases. 

We derive the rather unexpected finding that the manager's informational rents do not 
necessarily. increase as a function of his informational advantage. Both in the deterministic 
and stochastic settings, once the manager's level of private information is sufficiently large, 
his incentive to lie to the owner decreases, and the owner can reduce the level of rents used 
to keep the manager honest. We further show that while the owner and the manager would 
never select the same level of private information quality ex ante, the manager's desired 
choice at times.more closely approximates the welfare-maximizing level of private infor- 
mation quality.‘We thus identify the existence of settings in which there is an endogenous 
demand for thecowner to delegate or sell the right to improve informational quality to the 
manager. 

Given the:importance of information system choice in managerial accounting, it is not 
surprising that there is a'sizable extant literature on this topic. The vast majority of papers, 
however, have:looked at the choice of noisy performance measures to ex post evaluate the 
hidden actions:óf risk-averse managers (Banker and Datar 1989; Datar et al. 2001). In this 
setting, if the,principal can commit to his use of information, then the informativeness 
principle (Holmstrom 1979) generally allows one to determine whether additional ex post 
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information is of value.' Unfortunately, there is no analogous condition to value ex ante 
information. The accounting literature first examined the value of different information 
systems within an agency construct in Christensen (1981). Via a series of examples, 
Christensen (1981) finds that offering the agent more post-contract, pre-decision informa- 
tion (in the sense of Blackwell [1951]) may not necessarily make the principal better off. 
Following Christensen’s (1981) seminal contribution that more pre-decision information 
may be detrimental, the majority of the follow-on accounting literature has searched for 
sufficient conditions for making such information beneficial: (see, e.g., Baiman and Evans 
1983; Penno 1984; Baiman and Sivaramakrishnan 1991). In all of these models; the agent is 
risk-averse, and hence the principal tempers efficiency with the cost of paying the agent 
informational rents and risk premia. Because we are primarily interested in informational 
preferences, we assume all parties are risk-neutral, and examine the principal and agent’s 
preferred levels of informational advantage. With risk-neutrality, if the agent’s information 
arrives after signing the contract, then the principal can obtain the first-best outcome by 
“selling” the agent the firm. To rule out such pathological cases, we assume that the agent 
faces limited liability and possesses pre-contract information;.as in Sappington (1984). 
The economics literature has studied pre-decision information with risk-neutral agents 
(e.g., Lewis and Sappington 1991, 1993; Cremer and Khalil 1992; Cremer et al. 1998; 
Kessler 1998). This stream of work generally assumes that the onus to become asymmet- 
rically informed rests on the agent; the bulk of the analysis hence rests in providing incen- 
tives for the agent to seek pre-decision information and examining the ensuing equilibria. 
Unlike the economics literature, we allow for the possibility that the principal can influence 
the agent’s level of private information. Two other important distinctions are that we allow 
the agent to be imperfectly informed (which we show in many cases is his preferred choice), 
and we specify the quality of the agent’s private information to be common knowledge. 
More closely related to our paper are recent studies that have looked at the effect of 
private information on incentives (Bushman et al. 2000; Baker and Jorgensen 2003). Both 
model the presence of imperfectly informed agents, but neither allows for contracting based 
on reported information. In particular, Baker and Jorgensen (2003) show that’as an agent 
becomes more informed, the owner will want to raise the agent’s incentives. We show that 
in a full contracting framework with communication, the owner’s optimal response is rather 
to decrease the strength of incentives as the manager becomes more informed. In Bushman 
et al. (2000), the authors compare the principal’s payoffs when the agent is informed 
and his actions are unobservable (delegation) to the case where the agent is uninformed and 
his actions are observable (centralization). The authors show that although congruence 
between the agent’s performance measure and his individual task is beneficial, excessive 
congruence makes delegation less appealing. Both Bushman et al. (2000) and:Baker and 
Jorgenson (2003) assume that the agent’s information is post-contract, that is, the agent’s 
individual rationality constraint must be met in expectation over all states of the world. We 
assume that the agent’s information is pre-contract, such that the contract offered by the 
principal must satisfy the agent’s individual rationality constraint state-by-state. A recent 
paper examining the effect of imperfect, pre-contract information is Raith:(2004), which 
asks whether compensation should be based on noisy output when the agent’s effort is 
perfectly contractible. Raith (2004) finds that as the agent’s signal becomes more infor- 
mative, the owner will place more emphasis on noisy output and less emphasis on effort. 
We do not assume that the agent’s effort is contractible or even verifiable. Moreover, while 





! For a nice illustration of the failure of the informativeness principle when the principal cannot commit, see Arya 
et al. (1997). 
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in Raith’s (2004) two-state model, the owner’s expected profits are increasing in the agent’s 
pre-decision information, we find that the owner’s expected profits are initially decreasing 
in the agent’s level of private information and convex overall. 

In its examination of the owner and agent’s divergent preferences over private infor- 
mation, our study is most similar to Antle and Fellingham (1995). Using a model of re- 
source allocation with hurdle rates, the authors seek to determine the extent to which the 
owner and manager would benefit from making the agent's private information public. Their 
resource allocation model has a single hurdle rate, but several states of nature. Thus, com- 
munication between the owner and agent is coarse? and allows the agent to obtain positive 
rents in good states of the world. The authors find that divulging some of the agent's private 
information can either increase or decrease the agent’s informational rent? In their model, 
the agent's rents increase whenever the owner operates in states that were deemed unprof- 
itable without the agent’s private information, allowing the agent to-capture rents in more 
states. In our model, we vary the agent's private information by altering the correlation 
between the agent's private signal and the state of the world. As such, no new contracting 
opportunities arise as the agent's private information varies. 

The paper proceeds as follows. In Section II, we introduce and fully analyze the owner's 
preferences over information when the manager's output is deterministic. In Section Ш, we 
generalize our findings to the case of noisy output, using menus of linear contracts. Section 
IV examines the agent's informational preferences and the welfare-maximizing level of 
private information quality. Section V concludes. 


П. MODEL 
We model an agency relationship where all actions have been delegated by the share- 
holders (principal/owner) to a manager (agent). We assume that the manager has distinct 
tasks to perform. If he chooses effort level e, on task j, then his total disutility is given by 
c(e) where e = {е,,...,е,}. The manager's effort translates into revenues т for the owner 
according to the formula: 


k 
T= 2286 
ја 


where the vector В = (B,....,B,) is common knowledge. То simplify our analysis, we assume 
that c(e) = 3 Zi, е, where the parameter о is a random variable with support {0},...,0,}. 


Since с measures the cost of effort, larger values of с correspond to less productive man- 
agers, whereas smaller of values of с denotes good “types” of managers.* At the time of 
contracting, the value of o is unknown to both the owner and manager. The cwner and 
manager know only that с has support {0j,...,.0,} with corresponding probabilities 
{f(c,),--.f(o,,)}. However, before signing a contract and selecting his effort, we assume 
that the manager has private access to a signal s € {5,,...,5,}, which is related to o as 
follows: 


2 The communication is in fact binary—whether the agent's productivity is above or below the hurdle rate. 

3 See Antle and Fellingham (1995, Examples 1 and 2). 

4 All of our results continue to hold if we assume that the agent’s private information concerns the agent's output 
(revenue) rather than his costs. 
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where a Є [0,1] is an informational parameter. The conditional density of ø given s implies 
that the unconditional distribution of 5 is identical to с, that is Ко) = f(s) for i = 1,...,n. 
The following Lemma relates a to s and o. 


Lemma 1: The correlation between s and o is given by a. 


Clearly, the manager’s information is increasing in a, hence we shall say that the 
manager is more informed for larger values of a and less informed for smaller values of 
a. Note that if a = 0, then the signal s is uncorrelated with the true state of the world, с, 
and is thus entirely without information content. When a = 0, we shall say that the manager 
is perfectly uninformed. On the other hand, if а = 1, then s perfectly reveals o, in which 
case we label the manager as perfectly informed. The case of a = 1 coincides with the 
adverse-selection paradigm originating in Maskin and Riley (1984). We will initially assume 
that a is exogenously specified, but will later treat a as a choice variable.* 

The sequence of events is as follows. At time 0, the manager receives s, and at time 1 
the owner offers the manager a menu of contracts. If the manager accepts, then the manager 
communicates his choice of contracts from the menu to the owner. If the manager declines, 
then we assume both parties earn 0, and the relationship ends. At time 2, the manager 
selects his effort levels and the owner consequently earns т. At time 3, the owner pays the 
manager according to the terms of the initial contract. Figure 1 illustrates the entire se- 
quence of events. 





FIGURE 1 
Time Line 
0 1 2 3 
5 contract contract efforts т compensation 
revealed menu selected chosen realized made 
offered by 
by owner manager 


5 To illustrate our informational assumptions, consider a two-state example where the unconditional density i 18 
uniform. Then, the conditional probabilities in our specification reduce to the following intuitive expressions: 


a 
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The fact that the manager receives s prior to the contract being signed (pre-contract 
information) is an important feature of our model. As a natural counterpart to this speci- 
fication, we stipulate that any contract must guarantee the manager at least a minimum 
level of utility (which we set at zero without loss of generality} for every possible realization 
of s. Were the contract to be signed before the manager receives s (post-contract infor- 
mation), it is possible that the owner could “56” the firm to the manager for an amount 
equal to the expected profits of the firm assuming the manager realizes the mean cost о. 
For every realization of s, the manager would then select the conditional first-Eest action, 
and there would be no scope for communication. However, note that in this setting, the 
manager’s limited liability constraint would be violated for high (bad) signal realizations. 
On the other hand, if the owner cannot “sell” the firm to the manager ex ante, then all of 
our results transfer over to a post-contract information setting, because the owner faces the 
same constraints and objectives as in our model. 

We first consider the benchmark where the signal s is assumed to be jointly observable 
and contractible. To simplify notation, let б, = E[o|s = 5] and let e,, denote the effort the 
manager will put forth on task j when the public signal s; is observed. Given our conditional 
probability structure, we can write 0,(4) = ав, + (1 — ас. In this setting, when the owner 
observes signal 5, and pays the manager p, conditional on observing the correct level of 
revenues, she solves: 


k 
max > еВ; — Рі 
вірі j=1 

such that: 

б, 5 
Рт 2 > ey = 0. 
j=l 


The owner will clearly leave the manager with his reservation utility of zero in every 
state by setting р, = > 20 єї» This reduces the owner's problem to: 
k 8, k 
шах >) eybi- 5 Ха, 
еу j=l jul 
Solving this, the owner sets: 
_ В 
eff = 6, 


and we can thus write the owner’s expected profits as: 


arf (a) = өю 5 Код 


19 Ха) 


Note that the expected profits vary over a only via the (conditional) expected stetes of the 
world 8а). When the signals are uninformative, the conditional expected value of с is 
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simply the unconditional mean F, that is ¢{0) = с for i = 1,..,n. When the signal is 
perfectly informative, the signal perfectly reveals the state of the world, that is 61) = o; 
The following proposition demonstrates that in this setting, better information translates 
unambiguously to increased profits. 


Proposition 1: First-best profits are increasing and strictly convex in a. 


When a increases, the amount of information contained in the signal increases. In the 
first-best setting, the owner costlessly observes the signal s and can then make better de- 
cisions as to how much effort to ask of the manager. Higher levels of a thus result in higher 
levels of profits, as well as higher marginal profits. Note that the above result also serves 
to validate our choice of a as a parametrization of the level of information available. 

In a second-best setting, where only the manager learns s, it is less clear that the owner 
would always prefer higher levels of a. In particular, the owner may have to pay the 
manager informational rents in order for the manager to truthfully reveal (or make optimal 
use of) his private signal 5.5 

We now turn to the owner’s problem when only the manager can observe the signal s, 
and the manager’s output is deterministic. When revenues are deterministic, a mechanism 
where the manager reveals his information to the owner and the owner offers a contract on 
the basis of the revelation is equivalent to a mechanism without communication whereby 
the owner offers the manager a menu of revenue targets and corresponding payments. We 
assume that the owner employs the latter type of contract, and in particular specifies n 
(revenue, payment} pairs, {r, p;), from which the manager must choose. 

When the manager privately observes signal s = s, and “targets” revenue level r, he 
chooses his effort allocation across tasks as the solution to the following problem: 


A 


ô, È 
min 5, У, el 
such that: 

В е = г, 


where e; = (e;,....,e,,) is the effort vector chosen by a manager with signal 5 = s; Solving 
the above program, we find that the manager will expend effort on task j of: 


B 
a us Lt 
в.в” 


Let c{r) denote the minimum (expected) cost for a type i manager to generate г profits. 
Given the optimal effort allocation, we find that: 


$ In a post-contract information setting where the manager is committed to stay with the firm and has no minimum 
utility requirement, the owner would not have to pay any informational rents. In such a setting, the result in 
_ Proposition 1 implies that the owner would always prefer, if given the choice, to have as шине: of. а as 
possible. "IN (oM 
С ke лав 3.4 


desc T 


The Accounting Reb бу 200 ; 


684 Rajan and Saouma 


6 


X - ТҮМ 


72. 
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Given this expected cost of attaining a revenue target as a function of the manager’s signal, 
we can now specify the owner’s optimization problem: 


max 2 for, — р) (Р1) 
such that: . 

р- сб) =0 vi (1) 

р = Chr) = р, — chr) Vil (2) 


Constraint (1) ensures that the manager will accept the contract offered at time 1, regardless 
of the signal he obtained at time 0. Constraint (2) makes sure that the manager will truthfully 
reveal his signal to the owner by his choice of the appropriate {revenue, payment} pair. 

To give a meaningful answer to the owner’s problem, we make two additional 
assumptions: 


А1: The probabilities f(-) satisfy the following condition: 


F(oj) 
Хо) 





is increasing in i, 


where F(o;) represents the cumulative distribution function. 


А2: The states о, exhibit non-decreasing differences, i.e., (с, — є.) is weakly increas- 
ing in i. 


. Assumption 1 is standard and is satisfied by most common distributions, including the 
uniform distribution. It is the discrete analog to the usual assumption of log-concave dis- 
tribution functions (see Bagnoli and Bergstrom 2005). Assumption 2 asks that the relative 
dispersion of the “bad” states be at least as great as that of the “good” states. Because ап 
agent with signal s, is tempted to report signal s,,,, A2 bounds each agent’s temptation to 
misreport by his neighbor's temptation. Note that A1 places a restriction on the probability 
weights, whereas A2 restricts the support of с. Assumption 1 is used to prevent any pooling 
among the different types of managers, whereas A2 ensures that the managers with better 
signals are asked to produce more than those with less productive signals. A natural can- 
didate for o satisfying Al and A2 is when the states of the world are equally spaced and 
uniformly distributed. . 

We can now solve (P1). If the ex ante quality of the information signal is a, then let 
r(a) denote the optimal revenue targets issued to the manager when he reports signal s 
= $. 
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Proposition 2: Assuming A1 and А2, the solution to (P1) is given by: 


is (8:8)6:1(а) | 
ке C) E 1) +1 
Ко) 6, (а) 
а) 


п-1 
рКа) = р.а) + 2, В.В) (rXa) — пы (а)); 


ра) = с„(т„(а)). 





The solution to the second-best problem depends on a exclusively via (a). Recall that 
the expected state of the world conditional on s = 5, ба), is linear in a with endpoints 
6(0) = с and 041) = са, When a = 0, all the revenue targets are the same and the owner 
(in effect) offers a single contract for all different types. As a increases, the n different 
contracts grow further apart. Following the monotonicity of types, the target revenues are 
decreasing in i, such that good types are asked to generate more revenue than worse types. 
The worst type (s = s,) earns no rents, and the best type (s = s,) puts forth the same level 
of effort as in the first-best setting and earns positive rents. Moreover, the informational 
rents paid to the manager are increasing in type. The owner's profits are increasing in the 
manager's type, but also vary over the level of private information contained in the man- 
ager's signal. The following proposition characterizes the effect of changes in the manager's 
private information on the owner's expected profits: 


` Proposition 3: Second-best profits are decreasing in a at а = 0, and are strictly convex 
over the entire range of a. 


When a = 0, the owner offers a single contract and the manager earns no informational 
rents. As a increases and the contracts in the optimal menu grow further apart (the revenue 
targets diverge as do the payments), the manager begins to earn rents. Although increasing 
the manager's information helps him implement better decisions, for values of a sufficiently 
close to zero, these benefits are insufficient to warrant the higher rents necessary for the 
manager to truthfully reveal his signal to the owner. However, as a increases, the marginal 
benefit of informing the manager trumps the owner's marginal cost associated with paying 
informational rents; hence, the owner's profits are convex in a. Recall tbat in the first-best 
setting of Proposition 1, the value of additional information was increasing in the pre- 
existing level of information. Although it is impossible to disentangle the agency costs and 
the efficiency gains resulting from a change in a, Proposition 3 shows that the agency cost 
is sufficiently small that convexity in the principal's profits is preserved in the second-best 
setting. Moreover, Proposition 1 showed that the marginal benefit to the principal of in- 
creasing a when a = 0 is 0. Because the first- and second-best contracts are identical when 
a = 0, we can conclude that even when the agent is perfectly uninformed, there exists a 
positive agency cost associated with any increase in a. The agency costs associated with 
an informed agent and positive a are two-fold: the principal is paying the agent informa- 
tional rents, but additionally, she distorts the agent’s effort levels away from first-best in 
order to minimize the informational rents.’ 

An immediate implication of convexity is that, ignoring the costs of information ac- 
quisition, the owner will want the manager to either be perfectly informed (a = 1) or 


7 [n Section IV, we further examine the effect of varying a on the agent's informational rents. 
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perfectly uninformed (a = 0).8 An obvious question that arises is whether. we can charac- 
terize settings in which the owner prefers an informed to an uninformed manager. The 
following proposition identifies a necessary and sufficient condition for the owner to prefer 
an informed manager to an uninformed one. 


Proposition 4: If the owner can choose the level of information conveyed in s, she will 
make s perfectly informative if and only if: 


1 бг 
сл Fes. 1) 1 (3) 
Хо) No. | 
or, equivalently: 
B-B gp) а 
а 
is satisfied. 


To understand Condition (4), it is instructive to identify circumstances when it will not 
be met. When the agent’s signal is perfectly uninformative, the principal earns (B - B) 
se On the other hand, when the principal decides to offer the agent a perfectly informative 
signal, her expected profit is given by half of the agent's expected revenue target. When 
the states of nature do not vary significantly, the informed agent's revenue target does not 


vary significantly from the uninformed agent’s target of (В. В 5= | ; however, the informed 
agent’s targets have a distortion built in to minimize the agent’s ш А$ ien when the 
states of nature are relatively similar, 4E[r(1)] will tend to be less than (В · В) = because 


the incremental profits resulting from knowing s are insufficient to overcome the efficiency 
loss induced by the optimal contract. 

The following corollary demonstrates the above intuition explicitly by considering a 
setting where the states of nature are uniformly distributed and grow according to a geo- 
metric series. In this case, we show that the owner will prefer an informed manager when 
and only when the states of the world are sufficiently far apart. 


Corollary 1: Suppose that the states of the world are uniformly distributed with support 
{o,,....0,} where о; = cp"! for i = 1. Then, the owner prefers the man- 
ager to be perfectly uninformed if р < р(п), and she prefers the manager 
to be perfectly informed if p > p(n) for some 1 = р(п) x p(n) = 2. 


8 Lewis and Sappington (1991) derive a similar "all-or-nothing" result іп a Baron and Myerson (1982) model 
where the principal controls the probability with which an agent receives a perfectly revealing signal, as opposed 
to a null signal. In their setup, imperfect information ‘systems of the kind we study are not considered and, 
moreover, the principal relies upon an additional communication stage to determine whether the agent is perfectly 
informed. 
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When p is sufficiently close to 1 (recall that the states are increasing, hence p = 1), 
the states are relatively close to one another, and the owner prefers the manager to be 
uninformed. When p approaches 2, the states begin to differ greatly and the owner prefers 
the manager to be perfectly informed. Again, when the manager is uninformed, the owner 
offers a single generic contract that specifies a constant revenue target irrespective of the 
manager’s signal. This works well when the states are relatively close to one another, as 
the generic target is never too distant from the optimal target the owner would have chosen 
had she known s. Moreover, when the owner offers a single contract, she does not need 
the manager to communicate s and thus avoids paying any informational rents. On the other 
hand, when the states vary significantly, a single contract would specify a revenue target 
that is “significantly non-optimal" for many realizations of s. As such, when the states 
differ greatly, the owner prefers to issue individual targets according to the manager’s report, 
even when it involves paying the manager informational rents. 

A similar result holds when the states are uniformly distributed and the supports grow 
according to an arithmetic series. In fact, the intuition behind the similarity of states and 
the owner’s informational preferences can be generalized to any distribution of states. To 
do so formally, we must specify a metric that captures the extent to which the states are 
similar to one another. To this end, we next introduce the notion of linear mean-preserving 
contractions. 


Linear Mean-Preserving Contraction: For any random variable с with mean 0, sup- 
port (0,..,0,), and respective probabilities (f(c,),...,f(c,)), a linear mean- 
preserving contraction (Impc) is defined by a random variable o, with support 
[03,..,0,,) and respective probabilities (f(o,),....f(o,)), where о, = do, + (1 
— кб for à € (0,117 


A linear mean-preserving contraction (Impc) is the reverse of a mean-preserving spread, 
although we restrict the contraction to occur linearly. In particular, note that о’, has support 
с when А = 0 and o, has the same support as c when А = 1. Let L, denote the line with 
endpoints с and o, Then the i? element in the support of o, is a point on L, Moreover, 
this point moves closer to с for small values of А, and moves closer to 0, for larger values 
of АХ. We have plotted the supports of о, as a function of А in Figure 2. By considering 
linear mean-preserving contractions on an arbitrary random variable o, we are in effect 
parametrizing the similarity between states, such that the random variable o, is identical to 
g if = 1, and as А tends to 0, the support of а, collapses to с. Because с, collapses the 
support of с linearly to the mean, the expected value of о, is equal to с for any value of 
à. Moreover, because the expected state of the world is the only statistic that affects the 
owner's payoffs when the manager is perfectly uninformed, when the states of the world 
follow the distribution o,, the second-best profits (0) are constant over all А Є [0,1]. 
However, as the states grow further apart, having individualized contracts becomes more 
and more profitable and may eventually warrant providing the manager with information 
in spite of the additional costs incurred in doing so. The following proposition shows that 
linear mean-preserving contractions are sufficient to determine whether the owner prefers 
the manager to be perfectly informed or perfectly uninformed. 


? There may be some confusion as to whether c, is the random variable o, with А = 1, or whether o, is an 
element in the support of с. To avoid any confusion, we will explicitly label с; as a random variable unless we 
are referring to the constant. 
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Proposition 5: Suppose that the owner prefers the manager’s signal to be perfectly 
informative. Then, there exists а А Є (0,1) such that the owner prefers 
the signal to be uninformative when the states of nature follow the Impe 
а, for all А < Å. If the owner prefers the signal to be perfectly unin- 
formative, then she continues to prefer an uninformative signal when 
the states of nature follow the Impe o, for Х Е [0,1]. 


The proposition guarantees that given any distribution of the states of the world, the 
owner will always prefer to have an uninformed manager if all the possible states suffi- 
ciently resemble the mean c. Figure 3 plots an example of the owner's profits over a for 
various levels of Х. 


FIGURE 2 
Linear Mean-Preserving Contraction 


On С 


al 
a 


0 à 1 


We plot o, for i = 1,...,6. Note that в, is equal to the original mean о when à = 0 and g, is equal to the 
original support o, when А = 1. 


FIGURE 3 
Convexity and Linear Mean-Preserving Contractions 


т (a) 


ч) 0.3 0. 3 ze 
0.325 
0.275 
—1 a 


The The graphs plot a two-state еха plot а two-state example of the owner's expected profits over a for various levels of А with с, = 1, 
оз = 2.5, and Ка) = Ко) = 4. From left to right, the value of À decreases from 1 to 3 to 4. 
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Collapsing the states into o also affects the marginal profitability of offering a manager 
information. In the first panel of Figure 3, А = 1 and the owner’s profits are maximized at 
а = 1. As we decrease Х to 3 in the second panel, the owner's profits are still increasing 
for sufficiently large a, but not enough to warrant setting a = 1. In the last panel of Figure 
3, = 4 and the owner’s profits are always decreasing in a. In fact, the observation 
generalizes to any linear mean-preserving contraction: Proposition 3 and the proof of Prop- 
osition 5 directly imply that the principal’s marginal profits т?" (a) are increasing in А. As 
such, the benefits to increasing the agent’s private information are more pronounced when 
the states of nature are further spread out. For sufficiently small ^, the Impe o, will induce 
the owner's profits to be everywhere decreasing in a, i.e., she is worse off whenever the 
manager gains any additional informational advantage. 

We have thus far considered changes in the support of o that would cause the owner 
to change her informational preferences. A second possibility is that changing the proba- 
bilities associated with a fixed support could also cause the owner to switch her preferences 
over the choice of information system for the manager. We examine this next. 


Example 1: Suppose there are three states, (0,,05,0,] = {сту + Ка, + 2k}, with 
respective probabilities = = and x [0,1]. In this case, as 
x tends to 1, the owner becomes convinced that the true state is o, + k, 
whereas when x tends to 0, the owner believes that the true state of the 
world is either o, or o; + 2k, each with equal probability. To satisfy 
Assumption A1, we need x = V5 — 2 ~ 0.236; Assumption A2 is trivially 
met. Now, from Proposition 4, the owner prefers а = 1 to a = 0 if and 
only if: 


„ы! dum logra ey 


20 1+х +k 
: o, + k —— 2x 2-3 =i 


When x = 4, i.e., the states are uniformly distributed, equality holds in 


| 14 V7 . 1+ 
(5) if k = T Figure 4 fixes k = 1 


hand side of (5) as x varies from V5 — 2 to 1. 








c, and plots the left- 


As x tends to 1, the only possible state is с,, and communication becomes worthless 
as the first-best outcome can be replicated with a single contract. On the other hand, as x 
takes on smaller values, the owner becomes less and less certain of the true state and 
therefore the owner prefers to pay the manager rents in exchange for better information 





1 
regarding c. In particular, when k = ©, for all values of x Є [4,1], the principal 


prefers to go with a single contract and avoid paying rents. For values of x € [V5 — 2, 
31 there is sufficient uncertainty that the principal prefers to offer a menu of contracts and 
pay the manager the appropriate informational rents. 

Example 1 illustrates the difficulty of using traditional stochastic orderings as a way to 
determine the owner's preferences over the manager's level of information superiority. In 
particular, note that both the value of an informed manager, as well as the relative value 
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FIGURE 4 
Information Preferences and State Uncertainty 








Where the graph is positive the owner prefers a = 1, and in the negative region, the owner prefers a = 0. 


(relative to having an uninformed manager) are non-monotonic in x, despite the fact that a 
decrease in x represents a mean-preserving spread, a higher level of variance, and is riskier 
in the sense of second-order stochastic dominance. Therefore, a higher level of riskiness, 
per se, does not guarantee either that the absolute value or the relative value of having an 
informed manager is any greater. In the above example, note however that once the spread/ 
riskiness is sufficiently large, the owner prefers to employ an informed manager. We now 
demonstrate that this finding can be generalized, under surprisingly mild conditions, to any 
arbitrary n-point distribution. 


Proposition 6: For any arbitrary distribution satisfying Assumptions Al and A2, there 
exists a sufficient mean-preserving spread of the probabilities f(-) under 
which the owner prefers the manager to be fully informed, provided 
that а, = 20). 


In this section, we considered the effects of varying the manager's private information. 
We have shown that the owner's profits are initially decreasing and everywhere convex in 
the manager's private information. Hence, she will always prefer an uninformed manager 
to a poorly informed one, and will always prefer the manager to be either perfectly informed 
or perfectly uninformed. With regard to the latter, we have provided a full characterization 
of the circumstances under which the owner would prefer either outcome. In particular, 
sufficiently dispersed states of nature or probability levels over states imply a preference 
for a fully informed manager. 


III. NOISY REVENUE 
An important restriction in our analysis so far is the assumption that the outcome of 
the manager's effort was deterministic. As a consequence, we could restrict attention to 
menus consisting of revenue targets and attendant payoffs in order to control the manager. 
In this section, we analyze the impact of relaxing this restriction by considering settings 
where the manager's actions lead to stochastic revenue realizations. 
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Suppose the owner's revenue takes the: form п = В · е + є where the random variable 
£ has zero mean and finite, non-zero variance. The second-best contract analyzed in the 
previous section is then no longer feasible. One potential approach is to assume a specific 
functional form for the density of є, and solve for the optimal contract for that particular 
density. We pursue an alternative strategy, that of restricting attention to menus of linear 
contracts. A key advantage of our approach is that the contracts we study are robust to any 
density of =, i.e., they can be implemented even if the owner is unaware (or uncertain) of 
the precise distribution of the: noise in revenues. A second important benefit is that by 
characterizing the optimal linear contracts, we.are able to provide a direct comparison of 
our work to previous results on:incentive strength and information. Menus of linear con- 
tracts have been extensively examined in the accounting literature (see Kirby et al. 1991; 
Reichelstein 1992), although such models typically use a continuum of types, whereas our 
model includes a finite type space. 

The switch in settings from deterministic revenues to noisy revenues leads:to a reduc- 
tion in the owner's expected profits for the simple reason that menus of linear contracts are 
non-optimal in the deterministic setting. Linear contracts are costly in this environment 
because a type i agent can claim to be of type i — 1, and then select an effort level that 
generates greater payoffs than the principal’s preferred effort level e, ,. Note that if we had 
a continuum of types, then it is conceivable that the additional constraint.associated ‘with 
menus of linear contracts is costless, as the owner may already face the same constraint 
when solving for the optimal mechanism. Indeed, Rogerson (1987) demonstrates that, with 
a continuum of types and signals, one can restrict attention to menus of linear contracts, 
provided the payments to the manager in the deterministic revenue setting are convex in 
output. In our framework, however, even in a two-type setting where the payments to the 
manager are trivially convex in type, menus of linear contracts impose an additional cost 
to the owner, irrespective of whether the owner tries to implement the original second-best 
effort levels, or the optimal effort levels given a menu of linear contracts. In general, the 
optimal linear menu of contracts will implement different effort levels than the optimal 
second-best contract, even in the absence of output uncertainty. 

We consider linear contracts that consist of a fixed payment, x, and a share of firm 
revenues у, If the manager has received the signal 5 = 5, and subsequently chooses the 
contract pair (x,y,), then he solves: 


^ К 
б 
max УКВ е) + җ = 7 2,9 


As such, we find that a type i manager facing an / linear contract will put forth effort: 
эв, 
e; 8,' 


which will generate a total expected revenue of r,, = (B · B) y,/G,. When the owner offers 
a menu of linear contracts, she thus solves: 


max У, f(o)((1 — у), — x) (P2) 
xy i-i 


such that: 
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yrutx-cí(rn)e0 vi 
yrutx-crn)emyntx-corn) Vi. 


The first constraint above is identical to that found in the deterministic setting. The 
second constraint ensures that the manager always truthfully reveals his signal, even though 
he can subsequently choose from a continuum of effort levels.'? As in the deterministic 
setting, we must make two additional assumptions to guarantee that the solution to P2 is 
well behaved. We will again assume А1 and we need to slightly alter A2 to accommodate 
the stochastic setting. We will assume that the following modification of A2 is satisfied: 


А2*: The ratio c/c, , is increasing in i. 


Assumption A2* replaces A2 from the deterministic setting and serves to maintain the 
monotonicity of the effort schedule with menus of linear contracts. Note that A2* holds 
for a large number of situations, including the case where the support of o follows a 
geometric series. Assumption A2* is sufficient, but not necessary; the “natural” setting 
where o has an equally spaced support and is uniformly distributed continues to be ac- 
ceptable here, in spite of violating A2*. The following proposition gives the solution to 
P2. 


Proposition 7: Assuming A1 and A2*, the solution to P2 is given by: 








1. 

"ее й 
Ко; бүл 

х= Хүү + E а XD) (7) 

ху-- POE x (8) 


The solution to P2 is similar to that of P1: managers with better signals (smaller indices) 
put forth more effort than others and are also entitled to larger rents. The worst type of 
manager earns no rents, and the best type puts forth the same level of effort as in the first- 
best setting (i.e., when the signal is observed by the owner). However, the optimal menu 
of linear contracts distorts the manager's effort beyond the second-best contract found in 
the deterministic setting. To see this, let ej, be the effort put forth on task j by a manager 
with signal s = s, facing a menu of linear contracts, and let e, be the same effort when 
the manager faces the deterministic menu found in the last section. We have: 


10 Recall that in the deterministic setting, the manager did not need to communicate to the owner; it was sufficient 
for him to choose a revenue target that, in effect, communicated the manager's signal to the owner. With noisy 
outcomes, the manager cannot communicate his signal via revenue, because he can no longer ensure that a 
specific target level of revenue is achieved. Instead, the manager communicates his information to the owner by 
selecting a linear contract from the menu before selecting his effort levels. 
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The owner needs to distort the effort levels beyond those found in the deterministic 
second-best contract in order for the contract to satisfy the agent’s incentive compatibility 
constraint for all possible densities of є. In spite of the difference in effort levels between 
the deterministic and stochastic settings, our next result shows that the nature of the relation 
between a and the owner’s profits is unaltered. 


Proposition 8: Using the optimal menu of linear contracts, the owner’s second-best 
profits are decreasing in a at a = 0, and are strictly convex over the 
entire range of a. 


An important implication of Proposition 8 is that all of our results on the value of 
informing the manager continue to hold in the stochastic framework. Moreover, with menus 
of linear contracts, the owner still prefers an uninformed manager when the states of the 
world are relatively close to one another and prefers an informed manager when the states 
are spaced widely apart, as the next result demonstrates. 


Proposition 9: In the stochastic setting, if the owner can choose the level of information 
conveyed in s, she will make s perfectly informative when: 


and perfectly uninformative otherwise. 


The underlying intuition behind the owner’s preferences over a remains entirely un- 
changed between the deterministic and stochastic settings. As such, all of our results on 
linear mean-preserving contractions continue to hold. In particular, for any given random 
variable c, there always exists a linear mean-preserving contraction under which the prin- 
cipal will prefer a = 0 to = 1. For the example considered in Corollary 1, we find that 
when the support of а is given by (а, роуү,-4р" !o,), the principal will prefer a = 1 toa 
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= 0 whenever р = 3. Note that in the stochastic setting, the support of с needs to be 
further spread out than in the deterministic setting for information to be valuable. This 
follows from the fact that in the stochastic setting, the principal faces an additional con- 
straint in motivating the agent to select the correct effort level in addition to reporting 
truthfully. It can also be shown that Example 1 carries over to the stochastic setting, as 
does the subsequent existence result proving the owner’s preference for an informed man- 
ager under extreme mean-preserving spreads. 

In addition to being robust to any type of white noise, the use of menus of linear 
contracts allows us to explore the effect of information on incentives. Baker and Jorgensen 
(2003) is one of the first papers to explore the relationship between incentives and private 
information. In their model, a risk-averse manager’s private information cannot be com- 
municated to the owner. Instead, given the level of information contained in the manager’s 
signal, the owner offers the manager a single linear contract. As the slope of the linear 
contract increases, the manager’s effort level converges to the level the owner would have 
chosen had she known the manager's private information. The owner can recoup the cost 
of offering strong incentives via an upfront payment from the manager to the owner. How- 
ever, the owner does not offer the manager perfect incentives because deing so would 
require the owner to pay the manager a large risk premium, which is a deadweight loss 
that cannot be recouped. Baker and Jorgensen (2003) find that as the manager's signal 
becomes more informative, the marginal return to good decision making increases, which 
in turn induces the owner to offer stronger incentives to motivate the manager to make 
better decisions. Similar to Baker and Jorgensen's (2003) model, as a, the informativeness 
of the signal, increases in our setting, the value of making informed decisions increases. 
However, because we allow for communication via menus of contracts, the principal has 
more control over the decisions, and hence the extent to which decisions are informed is 
not necessarily increasing in incentive strength. The following result shows that in our 
model, as the manager's signal becomes more informative, it is in fact optimal for the 
principal to provide him with decreased incentives. 


Proposition 10: The slope of each contract from the optimal menu of linear contracts 
is weakly decreasing in a. The average slope is strictly decreasing in 
a for a > 0. 


The intuition behind this proposition rests in the IC constraint of P2. As a increases, 
i.e., the manager's information becomes more precise, the expected states of the world 
(6,,...,6,) grow further apart, which makes the linear contract (x,,,, у) appear more 
attractive to a manager with signal i. In order to keep a type i manager truthful, the owner 
must pay additional informational rents. However, the owner minimizes such rents by fur- 
ther distorting the effort level ultimately chosen by the manager via decreased incentives. 

The above finding shows that the owner will always choose to distort the manager's 
effort levels in response to increases in a. However, this does not imply that the mana- 
ger's rents are increasing in a. Instead, in response to an increase in a, it is possible that 
the owner distorts the manager's choice until his informational rents actually decrease. 
In the next section, we consider the manager's preferences over a; in addition, we solve 
for the welfare-maximizing level of private information and compare both to the owner's 
preferred choice of a. 
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IV. PREFERENCES 

Thus far, we have considered the choice of information systems only from the stand- 
point of the firm’s owner. As discussed in the introduction, it is often the case that the 
manager can influence his relative informational superiority over the owner. For example, 
the manager can choose to acquire better local knowledge, expertise, or additional data 
regarding specific markets, products, or production processes. Such data may be either 
unavailable to, or less interpretable by, the firm’s owners, thereby increasing the manager’s 
informational advantage. In this section, we begin by examining the manager’s preferences 
over the level of ex ante information quality. While it may seem obvious that a manager 
would always choose to maximize his informational superiority over the owner, we show 
that this is in fact not the case. | 

To keep the analysis tractable, in this section we restrict ourselves to only two equally 
likely states (n = 2). We begin with the deterministic setting and discuss the stochastic 
setting later. Suppose that с, = po,. From the proof of Corollary 1, we can deduce that 
the owner prefers a = 1 if and only if p = 2. From a social welfare perspective, the optimal 
value of a is that which maximizes the owner’s revenues minus the manager’s cost of effort. 
Formally, if we let W(a) denote welfare, then: 


п А 2 
wa) = до ria) - ХЕ?) 


1 1 2 
раа 


__ар-1)+р+1 
(Зар — 1) + р + 17/7 


where v is a positive constant. The following lemma shows that in the region a € [0,1], 
the function W(a) is maximized at either a = 0 or a = 1. 


Lemma 2: Social welfare W(a) is maximized at a = 1 if p = 1.25; if p = 1.25, then 
W(a) is maximized at a = 0. 


Similar to the principal’s preferences, welfare is maximized at the boundaries of a; yet, 
the two do not always coincide. In particular, for values of p € [1.25,2], the owner pre- 
fers the manager to be perfectly uninformed, whereas the welfare-maximizing solution calls 
for the manager to be perfectly informed. To understand the divergence, recall that the only 
difference between overall welfare and the owner’s payoffs are the informational rents paid 
to the manager. The manager’s rents can be expressed as R(a), where: 


R(a) = 2 Код, — c(r(a)) 


MR Cni === на 
M (ao, + (1 — aF + 2a(o, — су)? 
: a 
"P Go (бар — 1) + p + D 


The Accounting Review, May 2006 


696 - Rajan спа Saouma 


Here, т, and т are positive constants. The following result identifies the manager's рге- 
ferred level of private information quality, i.e., the one that maximizes his informational 
rents. 


Proposition 11: The manager prefers a = 1 for p = 2; for p > 2, the manager prefers 





222312 à _ 1+р 
the interior solution a = 36 = D € (0,1). 


A key implication of the above proposition is that for sufficiently large p, the manager 
wants to limit his amount of private information relative to the owner. In other words, even 
though the manager could costlessly choose to be perfectly informed, he prefers not to, and 
settles for an interior level of information quality. To see why, note that for sufficiently 
large a, the incentive compatibility constraint weakens; the manager realizes that submitting 
the wrong message results in heavily distorted revenue targets, so much so that he begins 
to develop a preference for telling the truth. As a consequence, the manager’s informational 
rents start to decrease in a beyond a threshold level, say 4. Moreover, the manager’s pre- 
ferred level of (interior) information quality à, decreases monotonically in p for values of 
p > 2, і.е., as the disparity in the set of possible state realizations grows. Because ап 
uninformed manager earns zero rents, we find that à > 0, however; in the limit, as p — ©, 
the lower bound on 4 is 1/3. 

A second important implication of the lemma is that the manager’s preferred level of 
a never coincides with the owner’s preferences. At the precise cutoff (р = 2) where the 
principal’s preferences switch from а = 0 to the welfare-maximizing a = 1, the manager’s 
preferences switch from a = 1 to an interior value à < 1. 

A final implication is illustrated in Figure 5, which shows the owner's choice of a and 
juxtaposes it over the welfare-maximizing solution and the manager's choice. Note that the 
welfare-maximizing level of a agrees with the owner's choice when p takes on extreme 
values, in particular when either p = 2 or p = 1.25. For values of p in the middle, i.e., р 
€ [1.25,2], the socially optimal level of a coincides with the manager's preferred choice. 
Put differently, if the owner will not choose the welfare-maximizing level of information 
quality, the manager will. As a consequence, because the contracts considered take a as 
fixed, total welfare will increase if the principal delegates the choice of ex ante information 
system quality to the manager when the spread between the states takes on intermediate 
values. 

Finally, we address the issue of delegation in the context of uncertain revenues and 
linear contracts. The following lemma recreates the above analysis for the case of stochastic 
revenues and optimal menus of linear contracts: 


Lemma 3: With the optimal menu of linear contracts, the owner prefers a = 0 if p 
= (1 + V2), and a = 1 otherwise. The manager prefers a = 1 for p 
+ 
= УЗ and a = 0 ~ V3 93 
at a = 0 for p = ДІ + V5); otherwise, а = 1 maximizes welfare. 





otherwise. Overall welfare is maximized 


As in the deterministic setting, with stochastic revenues and optimal menus of linear 
contracts, the manager reacts to sufficiently high levels of p (i.e., very disparate types) by 
deliberately restricting his quality of private information. In particular, for values of p 


t1 
> V3, the manager chooses a = (2 — V3) B 


21 « 1. The manager’s choice of a is 
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FIGURE 5 
Informational Preferences 


29 
e, 
» 
ө... 
попав», 
Жаалхэн лээ тааахаааава C 







Principal’s choice ее ее 
Agent’s choice +++ 
Welfare maximizing ——— 


1.25 2 р 


The socially optimal choice of a, in addition to the owner’s and the manager’s preferred choices, in the 
deterministic setting. 


strictly decreasing in p and converges to a = 2 — V3 as p — ©, A second similarity with 
the deterministic setting is the divergence of the manager and owner's ex ante informational 
preferences. It is important to note that this result does not follow from the fact that the 
manager wisbes to maximize rents and the owner seeks to minimize them. Instead, as a 
increases, the firm becomes (potentially) more profitable, as seen in Proposition 1, but in 
addition, larger distortions are introduced to limit the manager's rents. As such, there is no 
guarantee that increasing a does not increase both the owner's profits and manager's rents. 
Our result that the manager and owner disagree unambiguously is thus quite striking. 

А key difference from the deterministic setting is that, with stochastic revenues, there 
exists a region where neither the manager nor the owner wants to choose the welfare- 
maximizing level of private information quality. In particular, in the region p € [V3, 1 
+. М2], overall welfare is maximized by setting a to 1; however, the owner prefers a 
= 0 while the manager prefers an interior level of a. In this region, it can be shown that 
the agent's choice of a yields greater joint payoffs than the principal’s choice of a. Similar 
to the deterministic setting, overall welfare will thus increase if the principal can commit 
to allowing the agent to choose a before the two sign a contract. Note that in both the 
deterministic and stochastic settings, this result occurs for intermediate values of p. For 
extreme values of p (i.e., when the states of the world are either similar or sufficiently far 
apart), the principal prefers the welfare-maximizing level of a, and would not consider 
delegating the choice of information system quality to the manager. 


V. CONCLUSION 
In a setting where an owner delegates decisions to a manager, having a better-informed 
manager is not always preferred when the information is privately observed by the manager. 
In our model, we show that the owner's payoffs are initially decreasing and everywhere 
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strictly convex in the correlation between the manager’s signal and the true state of the 
world. As such, the owner will always prefer to deal with an uniformed manager to a poorly 
informed one, and will always opt for either an uninformed or a perfectly informed manager 
over a partially informed one. The owner’s preference among these extreme options is a 
function of the distribution of nature, in a manner made precise in our analysis. 

The above results have implications for the design of managerial accounting systems 
where an owner cannot interpret incoming information but can control the flow of infor- 
mation to her managers. We have shown that when the optimal effort level does not vary 
significantly, there is little use to informing the manager because doing so will cost the 
owner informational rents and only bring about small gains from more informed decision 
making. 

Our analysis also has implications in settings where an owner cannot control the flow 
of private information to her managers. In such a setting, we show that it is optimal for the 
owner to offer the manager weaker incentives as the manager's level of private information 
increases. Even more surprisingly, we show that a manager who has the option to increase 
his level of private information relative to the owner will often deliberately choose to curtail 
this advantage, because his informational rents are not necessarily increasing in the quality 
of his private information. Moreover, we demonstrate that the manager's preferred level 
of information quality often more closely matches the welfáre-maximizing level of 
information. 

Our findings hold in settings where the manager is able to achieve set revenue targets 
in a deterministic fashion, as well as when revenues are a stochastic function of the man- 
ager's effort levels. The majority of our results were generated with relatively few para- 
metric restrictions. We allowed for an arbitrary number of managerial tasks and associated 
effort levels, an arbitrary number of informational signals, and (with one restriction) any 
density function on those signals. One key aspect of our analysis, however, required ad- 
ditional restrictions. In particular, for reasons of tractability, we derived the informational 
preferences of the manager, as well as the results on disagreement over overall welfare, in 
a two-state setting. We believe that these results hold in greater generality though, and have 
verified them via simulations for the n — 3 and n — 4 cases. It would be desirable to prove 
these results for the general n > 2 case. 

When analyzing the owner's preferences, we introduced the notion of linear mean- 
preserving contractions. Using this construct, we were always able to distort the states of . 
the world o until the owner preferred an uninformed manager to an informed one. It is 
infeasible to create a similar linear distortion in the opposite direction to favor an informed 
manager without potentially including negative supports in the distribution of с. However, 
it may be possible to construct general nonlinear expansions that would achieve this ob- 
jective. With regard to the impact of varying densities, we demonstrated, under mild con- 
ditions, the existence of a mean-preserving spread that led to the owner's preference for an 
informed manager. We conjecture that the following, sharper characterization may be pos- 
sible: for an arbitrary discrete density, if the owner prefers an informed manager, then he 
will continue to prefer an informed manager for any mean-preserving spread of that density. 

Finally, while we have framed our analysis in terms of internal accounting systems, 
our results apply to any setting where incentive contracts are written among parties with 
asymmetric information. An especially compelling extension of our work is to buyer- 
supplier contracting, where the former, because of proximity to customers, typically pos- 
sesses superior demand forecast information (Cachon and Lariviere 2001). An important 
difference is that the demand information is communicated to a supplier who then makes 
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production decisions that affect the interests of both parties. In future research, it would-be 
of great interest to analyze such a model in full generality. 


APPENDIX 
Proof of Lemma 1 


Suppose that the unconditional means of s and с are given Бу 5 and ©, respectively. 
Denote the joint density of s and о by A(s, o). We have: 


соу, о) = 22 2, (s — Nor — €), о) 
= У > (и - Зо, — Sf (s)Prlo = оү | s = s, a] 


1-1 I=] 


-X«- sf) (a, meee ay, (o, — afe) 
- a>, (s, — 3o, — ФГ) 


Therefore we can write: 


Sa (5 - Do; — 955) 


corr(s, о) FO, 
Ia (з, — 92 f(s) JS. (c, — с)? Ка) 


Since s is simply a signal, we can choose the actual signals s; without loss of generality; 
in particular, we can set 5, = о, in which case corr(s, с) = a. 


Proof of Proposition 1 
Differentiating the expression for the owner's profit with respect to a yields: 


еъ BOB (c; — 0) 
wh (ај = SPD fo ELT, 
As such, note that: 
т” (у = -EPS ож - в) = 0. 
Moreover, we have: 
п Em 2 
т^" (а) = (B-B) >, Код HEX > 0. 
imi і 
Proof of Proposition 2 


We shall omit the dependence of г, p, and с upon a for this proof. Note that (2) implies 
that if (1) holds for i = n, then it must also bold for i < n, as: 
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Pi — chr) = p, — cír,) > p, — сі = 0. 


The first inequality above holds by the IC constraint, the second inequality holds be- 
cause 6, < б, for i < n, and the last inequality holds by the IR constraint, and is met with 
equality in the case of i = п. As such, we can, with no loss of generality, set p, = c,(r,). 
We will only consider the local upward incentive compatibility constraints of the form: 


Pi — ст) = ра — CAT 1) 
and will verify that the optimal solution to this problem satisfies the entire set of constraints 


(2). 
If the local upward IC constraints bind, then the optimal contract satisfies: 


ó 


Pi = ры: + XB- В) ГУ (r? — т. 


We can express this equivalently as follows: 


п-1 б 


Pi = р, + У Еко — та). (9) 


The owner’s problem can thus be re-stated as: 





п êr? "1 6 
max $ f(n- у вр - та). ша 


Abusing notation, let К(с,) = 0. Then, we can rewrite the owner's objective as: 


2 


Since our unconstrained objective is quadratic and concave in each r, the unique solution 
to our problem can be found via the first order condition, yielding: 





cbr Ôi s 
max 5 È| fex - Ви - o) Sr 


(В · В) 


п = у 
0,4) + бү 


1407) 
(0,- 
Кор! 
We claim that r, is decreasing in i. To prove this claim, it is sufficient to show that the 
denominator is increasing in i. Recall that Al assumes that F{o,)/f(o, is increasing in i, 
and б, is increasing in i by construction. Therefore, it suffices to show that б, — 6, , is 

increasing in i. The following statements are equivalent: 
6, — б, = 0,4, — 6, 
ac, + (1 - ajo — ас - (1 — ајб = ас, + (1 — ae - ав, — (1 — ас 


ас; з a0;_, = dO, , — ад, 2. 
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The last line above holds by A2. By construction, the solution satisfies local upward 
incentive compatibility. The following equivalent statements verify global upward incentive 
compatibility, i.e., that type i would never want to take a type i + 1 contract. 


Di — CAT) = pua — стр) 


р, == ры Cfr) + er) E (11) 
itl-1 б У ў 
> XB- В) В) (г? = ra) = Б B (r? — гр. (12) 


To go from (11) to (12), we use (9). The inequality in (12) holds since r? is decreasing 
in i whereas б, is increasing in i. 

Finally, we need to verify that type i would not want to mimic some type i — І. The 
following equivalent statements verify global downward incentive compatibility: 


p; = CAT) = Pi- (1-0 
Pi = pia — cru) + cfr) 


i-1 ^. 6 


9, 2. „2 
= 2 Xp. Па * 38.5 








(ri гі). 


The final inequality holds again by the monotonicity of 6? and r? in і. 


Proof of Proposition 3 


Label the owner’s unconstrained objective function as M(a, к). Again, label F(o,) = 0. 
Ignoring the scaling of (B · B), we can write: 


M(a, г) = 2, | fou (В), — Fo) 2 r? + F) бы | 
Let У(а) = max, M(a, г). The envelope theorem yields: 


vo = муа,т) = $ | re) & 9 9) 8 + е) SA | 


SUL Ui eco) 72-29 &94| 


ізі 


If a = 0, then г, = г for i = 1,... п; hence, we can write: 


VO = 2 >, Fs.) FZ со, 


where the last inequality holds by the monotonicity of o, in i. 
To show that profits are strictly convex over a, we first prove a useful mathematical 
fact: 
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Claim: Suppose G(a) = max, g(a, 2), and z(a) € argmax g(a, 2). Then if е (а, z(a)) 
= 0, С is convex in a. Moreover, if either g,,(a, z(a)) > 0 or z(a) is the unique 
maximizer of g(a, 2), then С is strictly convex in a. 
For arbitrary œ Є [0,1], let aa + (1 —a)a = а. Now, we have: 
ска) = g(à, 2(4)) = agla, 2(4) + (1 — a)g(a, z(2)) 
= og(a, ща) + (1 — a)gG@, ка) = aG(a) + (1 - а)ба). 
The first inequality holds by the convexity of g, and the second by the optimality 
of z(a). Clearly if either or both inequalities are strict, then G is strictly convex. 


Now, returning to our main result, note that the unconstrained objective function M is 
linear in a, i.e., M, (a, г) = 0. Since M is quadratic in each element of г, there is a unique 
maximizer of M, hence V"(a) > 0 by the above claim. 


Proof of Proposition 4 
By Proposition 2 we have «"*(0) = (В · В) = Now, the principal prefers the agent to 
be informed if any of these equivalent statements hold: 
(0) = т" (1) 
1 n 
(B-B ac = У Једи - 


n 


La > По ар - о, Ко, 1) 


a В) i-i 
в = $ с ы: uo) 
(8-8) 2= = > fon xs -В) 2 гі С 9,1) Ка) + га | Ко). (13) 


By Proposition 2, we can restate (13) as: 








1 B-B 
ape | Jio 


(B: B = È fon — у È rho) = 5 > nf). 


В-8)2:5 Š fon- 


Proof of Corollary 1 | 
Proposition 2 implies that the owner prefers а = 0 if any of the following equivalent 
statements hold: 





T^(0) = are 
(В · Вукр — D. 1 
20 (р" — p E 12 p71 + (p — Di) 
пр — 1 
(р" — р т 
m ^ 1 
D ERES E. gi > реза + Ф — Di uM 


When p = 1, it is clear that equality holds in (14). Note that: 
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lim 2 п? _ па — n) 
ра бр [1 +р+р+... + р! 2 
7808 1 


lim маи). 
р ао Ww 


By continuity, we find that for sufficiently small р the owner will prefer а = 0. There also 
exists p(n) such that, if p > p(n), the owner prefers а = 1. We will now show that р = 2. 
Recall from (14) that the owner prefers a = 1 if and only if: | 


п? п 1 
р ate = ЕО (15) 


A simple calculation reveals that the inequality in (15) holds for n = 2 and p = 2. To show 
that (15) holds for т = 3, note that the right-hand side is clearly increasing in n. It is 
therefore sufficient to show that the left-hand side is decreasing in n for п = 3. To see this, 
note that we can rearrange the left hand side of (15): 


WD 
1+р+р? +... + рт! р" 1 
Moreover: 
9 | р-1)| | (р - DnQ — 2р" + тїор[р]р") | аб) 
дп | р'- 1 (1 — р")? : 


A sufficient condition for the right-hand side of (16) to be negative is nlog[p] — 2 = 0, 
and since 3log[2] > 2, we find that the right-hand side of (16) is negative for n = 3 when 
p = 2. 


Proof of Proposition 5 


Define т(а, №) as second-best profits when the states of nature follow the random 
variable су. We will show that «55 (a, №) = то (Aa). From (10), second-best profits are a 
function of & and not o; therefore, it suffices to show that & (а) = б(Ха). To this end, we 
have: | | 


6,, (а) = ас, + (1 — ae 
= ада; + (1 — X)o) + (1 — aye 
= до + al ~ Хо + (1 — ae 
= ado, + (1 — ad)o = бах). 
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Proof of Proposition 6 
Consider a new density that places probability p(o;) on o, where: 
m С, Bu С; 
po) > Ко; с, — а, 
ріс) = 0 i€ (2,...,n - 1). 


p(o,) = У Код 219 
i=] т 





с 


п 1 


Note that 21; p(o) = 22, Кора, — 0)/ (0, — сү) + (в, — о) /(а, — оц) = Zhi Јо) 
= 1. Moreover, the mean is preserved as: 


> р(сдо, = р(а ја, + р(о,)о, 


= У JO). ои, = а) t a9; - o) 


i=l С, — ©, 


n 


x 2, Која = v. 


We will now show that the new distribution is а mean-preserving spread of Ше original 
опе. Let P(o) = 2j., p(oj). Recall that when two distributions share the same mean, Р is 
a mean-preserving spread of F if and only if F second-order stochastically dominates P. 
Moreover, if P and F cross just once, with P(o;) = F(a, for all i less than some threshold 
i* and Р(о) = F(o,) for i > i*, then F second-order stochastically dominates P (see 
Fishburn and Vickson 1978, section 2.16). Hence, it is sufficient to establish that F crosses 
P once from below to prove that P is a mean-preserving spread of F. To this end, we have 
P(a;) = plo) = Доу) + Zi-; Flo), — 0)/ (0, — 01) > Го) = F(o,). Because Р(о,) is 
flat for i < n, whereas F is increasing, to establish that Р and P cross once before o,, it is 
sufficient to demonstrate that P(o,_,) — F(o, ,) < 0. We have: 


n-i == 
Plon) — F(G,-1) = 2, fe = ) 


9, о 
n-l 
о о, 
Е ---4|«0, 
2 ло = 2 У 


Thus, P is а mean-preserving spread of F. To complete the proof, we must show that 
under the density p(-), the owner prefers an informed agent to an uninformed agent when 
20, = ос, From Proposition 4, this requires (letting p, = p(o): 

1-р, 1 
> — 
с 1 с 


1-р, 





(с, Бе о) + 9 


which is equivalent to any of the following statements: 
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D, (1 — рі)? з 1 
со о„—о+о(1—р) ро; + (1 - руб, 
pi(o, — русу) + (1 — pio, > 1 

сс, — opi) р бу + (1 ~ poo, 


Pio, — ріс? + (1 — руурлоз + рий — po, — руді) + (1 — роо, 
> 010, — Pio} 
(1 — pop, + pd — p2o,(o, — руду) + (1 — ружа, 
> (00, — род — P) + p) 
(1 — p,oip, + po, (0, — руду) + (1 — pio, > (010, — рб + p) 
(1 — pyoip: + pio, (o, — p10)) + poi + p) + сьо (РФ; 3)) > 0 
(1 — рудої + о,(о, — р101) + 01 + p) + оо „р: — 3) > 0 


01(1 — p, + КЕ — р) + 1 + p, + Kp, — 3)) > 0 (+= =) 
1 


(k — 2)(k – 1) > 0. 


Proof of Proposition 7 
The owner solves: 
max > Кожа — Wry — x) (P2) 
xy i- 
such that: 
yutx-cn)0 Vi (17) 
yir; tx СА) S ути + х СТ) VEL | (18) 


As before (17) will bind only for the worst type (i = п). We will solve the problem assuming 
that the local upward constraints in (18) bind, and then show that the optimal solution 
satisfies all the IC constraints in (18). 


As discussed in the text, a type i manager facing contract уг + x, will choose to 
generate an expected revenue of r,, = (В · В) y//6,. Since (18) binds for type n, set: 


_ _(B-B) 
х, = 28, у. 
As Ше local upward ICs bind for i < п, we can write: 


х= унин + Хна — СК) — Yiu + cru) 


= Xia, + E (Уба — УР. 


Substituting back into the owner's objection function, we can simplify P2 to: 
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max(B - a(S to) € О Y но) 2 - > ба.) 22 х xL) (19) 


Solving, we find that the owner optimally sets: 
1 


У; я ее 
Ко) _ б _ 
(52: | (= 1) иь 


We claim that у, is decreasing in i. Assumption A1 implies that F(o;)/ (с is increasing 
in i. Therefore, it is sufficient to show that 0,/0, , is nondecreasing in i. By A2*, c/o, , 
is increasing in i. If a = 0, then 6, = є, and we are done. If a > 0, then we can write: 





ô _ ас; + (1 — ae 2 о, + sS 
6,, ас + (засо ots 
Now, 6,/6, , is increasing in і if and only if any of the following equivalent statements are 
true: 
ô „д 
8,4 б; 
с, + $ ~ Tint S 
gi, + $7 су + $ 
(о, + S)(oj-2 + 5 = (о, + 5) = 0. (20) 


Now, (20) holds trivially for 5 = 0 by A2*. To see that (20) holds for 5.> 0, we can 
differentiate (20) in S to obtain: 





д 
as ((о, + So, + 5) – (а, + 59) = g, + 0,2 — 29,4 > 0. 


The inequality above holds by A2* and the following claim: 


Claim: Suppose a > b > с > 0 and a/b = b/c, then а + c — 25 > 0. 
To see this, note that a/b = b/c = ас > b*. Therefore a + c - 2 >а + с 
— 2Vac = (Va — Vc)? > 0. 


We now verify that the optimal contract satisfies global upward IC via the following 
equivalent inequalities: 


Х, Уты — Cru) = Ха + Year — Cu) 


а у) 
itt- 90 
X аа on 
= 1+1 
The last inequality above follows from the monotonicity of у, and б, in i. To verify 
downward IC, note that the following inequalities are equivalent: 
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X + угы — ст) E xat yedua- (0 


x, =X. + E ot. — yi) 





i-1 1 В 2 В 
=> + == = 7 
0 = > 26, Oa у) 26, ОТ у) 
The last inequality, again, holds by the monotonicity of б, and y, in i. 


Proof of Proposition 8 
Recall from Proposition 7 that the owner’s unconstrained objective ЭЛЭГ 18: 


Му, а) = (В. a(S Ga ә де) 20 ZID, ро) 2) 


2) 
26,4] 


Let V(a) = max, M(y. а). By the envelope theorem, V'(0) = My, 0). 





23 > (а, — T) Fo- 


муу, 0) = В: a(S (s, - s (fe) = Ск 2:25) 
іт] 
- Ево - DFe) gE), 
If a = 0, then б, = c and у, = y. Letting F(o,) = 0, we can write: 


5.0 = -ED ($ о, — are) - $ бо – Фа) 


[вх (5 (v, = DFE) + fle) — Ў (л - 222) 
QE > F(o,_ 


io; — о) < 0, 


where the last inequality holds by the monotonicity of а, in i. To see that V(a) is convex, 
we have: 


мова = 6-03 (ret ши - 3 re = 
1-1 


іч І 


: => У Xii З XY ут 
= 1B a(S nfo) 4 —299 нэ “Эр 2 Ti- ко. 2) + Es), 
where т, = (ода) — с). All the terms in the expression above are positive by the mono- 
tonicity of y, hence V"(a) > 0 by the claim in our proof to Proposition 3. The owner 
chooses a = 0 or a = 1, because she maximizes V(a), which is convex in a. 


The Accounting Review, May 2006 


708 Rajan and Saouma 





Proof of Proposition 9 
From Proposition 7, the principal’s profits when the manager is uninformed is given 
by (0) = EE Therefore, the principal prefers the agent to be perfectly informed as 
long as any of the following equivalent statements hold: 
w(0) = T 
2 
— 1 
pig Osp pTO 
ss a – 128 + У noy - Y На.) 20 
1 «У Под Суу, FO , Ес) а] 
772 > 73) * qe) Код iE А 
1) [, _ Ес) , Fo) ЁЛ | 
2925 о; |» mens Fo)” E HA a 
v f) _ 9 | , Fo) 29, 
E рь- sz a * fe) У ( a 


ale ае 
IA 


о = - 8) 
1 


For а = 1, the closed-form solution given for у, in Proposition 7 implies that 2203 
1 
= (1 - (0,/0*—1))(1 — (F(o)/f(a))). Therefore, we can rewrite the last line above ав: 


1 я. | 2 pic» 
s^ à 9; | я-я У: | 
1 ~ Дод 
е2 s, ba 
Тави] 
с а, 
Proof of Proposition 10 


From Proposition 7, we have: 


ED s 8 M — ~ ^ 
(5 Ко) 1) (6, ,(a)8,(a) — 6; ,(a)6a)) 


ya) = = 
F(oj) б, _ 
eta (FE - Je JE 


Note that we can write: 
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бабка) — бі (а)ба) = (ao;_, + (1 — aya, — 9) 
- (e, — Sas, + (1 — dg) 
= (о, — в. №. 


As such, we can re-express у (а) as: 


FO) _ _ 
es ye 45 
5 = 0, 


дають Із сл. 
А Fo) | б, _ 
st а (FS (è ) ш ) 


where strict inequality holds for all i > 1 since с, > o, , and F(a) > Ко). 


Proof of Lemma 2 
Note that: 
1 1 2 
Me) = » а ар 


__ар-1])+р+1 
(За(р—1)+р+17/ 
Therefore: 


we -vz-( 


4a(p — 1)Ga%(p - 1)? - (р + 1)? + 14а(р? c 
20, : 


(1 + 3a(p —1} + p)*(1 + a + p — ар)? 


The equation W'(a) = 0 has three solutions: a = 0 and the two solutions to the quadratic, 
За р — 1)? + 14a(p? —1) — (р + 1)? = 0. Consider first а = 0. We have: 


2wp — 1)? 





VOD па 
+ 
Hence W(a) must be initially decreasing at а = 0. The additional two roots are T ET 
+ 
(—7 + 2\/ 13), опе of which is clearly negative. Label а" = 52-0 (—7 + 2М 13). If 


а" = 1, then the function W(a) is decreasing over Ше entire interval [0,1]; if а" < 1, then 
the function W(a) is decreasing on the range [0,a*] and increases over [a*,1]. In either 
case, it is clear that the function is maximized at either a = 0 or a = 1. Plugging in these 
values into the closed-form solution for W(a), we find that W(1) = W(0) if and only if: 


(p — 1)*(4p - 5) = 


8p? — бр + 2 9 


which is equivalent to p = 1.25. 
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Proof of Proposition 11 
We will show that R(a) is single peaked on the interval [0,2(1 + p)/3(p — 1)]. Note 
+ 
that a = 3225 is the unique solution to R'(a) = 0. Moreover, we have: 
6(p — 1)Ga(p - 1) - Хр + 1)) 


Fr (1 +р + Зар у ’ 





ae D + 
which is negative in the range a € |» о 2010) As such, the manager will select a 
+ 
= ] whenever ОР EET = 1, ie, when р = 2. When p > 2, the manager's rents are 
S _ р+1 
ee = en 
Proof of Lemma 3 


Given uniform density, Proposition 9 implies that the owner prefers a = 1 overa = 0 
if and only if: 


: 1 пр — 1) 
2 а + 06-15 ро 


For п = 2, this reduces to the following equivalent statements: 


р’ — 3p? +p +120 
р=1+ \2. 


Therefore, the principal prefers a = 0 when р Е (1,1 + V2]. Now, denote the manager’ $ 
rents under фе menu of linear contracts by Аа) where: 


B- mot a(z- 


When n = 2, the rents simplify to: 


R(a) = -i)r (0,1). 


с 


1 9 
Ка) = XB-B 53 а(р р Па; 


_ Ф (p —1)о, ас; + (1 — Фо) 
2(B · В) (ао, + (1 — ajo)? 


E Ui naa dms, 


ЭВЭР) (s +) вар + (1 — а) + ЮУ 


к Ка + 19 p - а) 
(р +1 + афр – D»' 


where к is a positive constant. Differentiating, we have: 
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' Ri(a) = к р - 1)? + (p + 1)? — 4a(p? — 1) 

| (p + 1+ ар - 1))* ` 
The numerator above is positive at a = 0 and decreases in a for a € [0,1]. f p = V3, the 
numerator of К;(1) is non-negative, ee that ка) > 0 for a Є [0,1); as such, the 
manager would prefer a = 1. For p > V3, there is a unique interior solution at which the 
first-order condition is met. Solving, we find that the manager’s optimal choice is given by 

+ 

a-Q- УЗ)» г су 

Overall M can be expressed аз Ка), where: 


ЛЭН) 62 У) о). 


‚ЖЕ m 


With n — 2 and geometric states, we have: 


" 1 1 6,06, — 6) 
та) = в) ЇЕ TU 8 ) 





Taking the derivative yields: 


Яка) = 


c 260 — [ар — 1» — ЗаХр — Пр + 1) + 7a(p — Тр + 1 — (1 + p 
(1 + a + p(1 — а)) 1 — a + ра + 4) 


Note that W;(0) = 0, but У 0) = —2(p —1)?/(p + 1)? < 0, so W;(a) is decreasing at a 
= 0. Moreover, the cubic in the numerator of У/ (а) has a unique real root; hence, W,(a) 
changes direction at most once over a € [0,1], at a global minimum. Therefore, Уа) must 
be maximized at either a = 0 or a = 1. Comparing W,(0) with У/(1) shows that СГ) 
= W,(0) if and only if p = (1 + Уз). 
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stream of literature argues that this result is attributable to a more general growth effect 
and that growth-related factors such as diminishing returns to new investment explain 
the lower persistence of accruals (e.g., Fairfield et al. 2003a; Cooper et al. 2005). We 
provide new evidence indicating that temporary accounting distortions are a significant 
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L INTRODUCTION 

nderstanding the properties of accruals is arguably one of the most important goals 
| | of financial accounting research. The primary objective of financial accounting is 
to provide information that is useful to investors. Accrual accounting has emerged 
as the accepted method of achieving this objective. Accrual accounting centers on the 
identification and measurement of assets and liabilities, with accruals representing changes 
in noncash assets and liabilities. A natural goal for financial accounting research is to 
investigate how accruals facilitate investor decision making. In an attempt to address this 
question, Dechow (1994) provides evidence that accrual accounting earnings are superior 
to cash accounting earnings at summarizing firm performance. In follow-up research, Sloan 
(1996) shows that the accrual component of earnings is less persistent than the cash com- 
ponent of earnings. Sloan’s (1996) results suggest that while accrual accounting is superior 
to cash accounting, the accrual component of earnings should receive a lower weighting 
than the cash component of earnings in evaluating firm performance. He attributes this 
result to the greater subjectivity involved in the estimation of accruals. Sloan (1996) also 
finds that investors do not appear to assign a lower weighting to accruals, leading to sig- 
nificant security mispricing. He concludes that investors do not fully comprehend the greater 
subjectivity involved in the estimation of accruals, causing them to make flawed investment 

decisions. J 

Subsequent research extends Sloan’s (1996) work on the lower persistence of the ac- 
crual component of earnings. This research can be classified into two broad streams. The 
first stream of research builds on Sloan’s (1996) conjecture that the lower persistence of 

‚ the accrual component of earnings arises because the estimation of accruals involves greater 
subjectivity. For example, Xie (2001) shows that the lower persistence of the accrual com- 
ponent of earnings is attributable to the **abnormal"" component of accruals, defined using 
the modified Jones (1991) model of normal accruals. He concludes that the lower persist- 
ence of accruals is primarily attributable to opportunistic managerial discretion. Dechow 
and Dichev (2002) and Richardson et al. (2005) (hereafter, RSST) show that the lower 
persistence of accruals is consistent with the existence of estimation error in accruals. These 
papers conclude that the lower persistence of the accrual component of earnings is attrib- : 
utable to temporary accounting distortions arising from accrual estimation error. Note that 
such distortions could arise both from unintentional errors in forecasting future benefits and 
obligations and from intentional managerial manipulation. 

The second stream of research argues that accruals are correlated with economic char- 
acteristics, such as firm growth, and that these correlated economic characteristics are re- 
sponsible for the lower persistence of the accrual component of earnings. In particular, 
Fairfield et al. (20032) (hereafter, FWY) show that the lower persistence of the accrual 
component of earnings extends from the working capital accruals considered by Sloan 
(1996) to growth in noncurrent balance sheet accounts. They suggest that the lower per- 
sistence of the accrual component of earnings arises from the interaction of firm growth 
with the lower rates of economic profits associated with diminishing marginal returns to 
increased investment. Variants of this explanation are embraced by Titman et al. (2004), 
Anderson and Garcia-Feijoo (2006), and Cooper et al. (2005). FWY also propose that the 
lower persistence of the accrual component of earnings may result from the interaction of 
firm growth with the conservative bias in generally acceptable accounting principles. 

The objective of this paper is to provide additional evidence on the role of accounting 
distortions in explaining the lower persistence of the accrual component of earnings. We 
begin with a series of numerical examples that illustrate the impact of accounting distortions 
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on accruals and earnings. The key insight emerging from these examples is that the observed 
empirical relation between accruals and earnings persistence is consistent with the existence 
of temporary accounting distortions arising from accrual estimation error. Next, we provide 
two sets of empirical tests examining the role of temporary accounting distortions. First, 
we decompose accruals into a "growth" component and an "efficiency" component. The 
growth component reflects accruals that are attributable to growth in output, while the 
efficiency component reflects accruals that are unrelated to growth in output. The effects 
of diminishing marginal returns to increased investment should be captured by the growth 
component of accruals. Our results indicate that both the growth and efficiency components 
contribute to the lower persistence of earnings. These results indicate that diminishing 
marginal returns do not provide a complete explanation for the lower persistence of the 
accruals, suggesting that accounting distortions play a significant role. 

Our second set of tests examines the association between accruals and SEC enforce- 
ment actions for alleged earnings manipulations. We show that firms subject to SEC en- 
forcement actions have abnormally high accruals at the time of the alleged earnings ma- 
nipulations and unusually low accruals following the alleged earnings manipulations. These 
results are consistent with the opportunistic use of managerial discretion to generate ac- 
counting distortions that temporarily inflate accruals and earnings. 

Overall, our analysis and results suggest that temporary accounting distortions provide 
the most compelling explanation for the lower persistence of the accrual component of 
earnings. Moreover, our SEC enforcement action tests suggest that some of these distortions 
arise from the opportunistic use of managerial discretion. At the same time, we cannot rule 
out a supplementary role for growth-related explanations, such as diminishing marginal 
returns to new investment. 

Our findings have important implications for financial accounting, because they indicate 
that the lower persistence. of accruals is at least partially attributable to distortions arising 
from accrual accounting. Accrual accounting involves a trade-off between the increased 
relevance associated with accruing future benefits and obligations and the reduced reliability 
associated with estimating the amounts of these future benefits and obligations. Our results 
suggest that the reduced reliability associated with accrual accounting results in temporary 
accounting distortions that have a significant impact on earnings persistence. The current 
push by accounting standard setters toward “Таг value" accounting is likely to increase the 
frequency and magnitude of accounting distortions, leading to a reduction in the persistence 
and reliability of earnings. | 

The remainder of the paper is organized as follows. Section II provides a more detailed 
discussion of accruals and their implications for earnings persistence: Section Ш develops 
numerical examples that illustrate the effects of accounting distortions on the properties of 
accruals and earnings. Section IV describes our data, Section V presents our results, and 
Section VI concludes. 


П. RESEARCH DESIGN 
Alternative Explanations for the Lower Persistence of the Accrual Component 
of Earnings 
Our objective in this paper is to discriminate between alternative explanations for the 
lower persistence of the accrual component of accounting earnings (where earnings are 
deflated by an appropriate measure of invested capital and thus expressed as an accounting 
rate of return). Formally, this result states that y, < 0 in the following equation: 
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RNOA,,, = Yo + Y, RNOA, + YACC, + з, (1) 
where: 


RNOA,,, =.accounting rate of return for period t+1; 


RNOA, = accounting rate of return for period 5 
ACC, = accrual component of the accounting rate of return for period t; and 
vı = period t+1 innovation in RNOA,,, (assumed to be mean zero and uncorrelated 


with RNOA, and АСС). 


From an intuitive perspective, this result manifests itself in the form of the following em- 
pirical regularity: 


Empirical Regularity: Firms with high (ow) accruals in the current period tend to 
have a high (low) accounting rate of return in the current pe- 
riod, but tend to experience significant reductions (increases) 
in their accounting rate of return in the subsequent period. 


Panels A and B of Figure 1 illustrate this empirical regularity using our sample and variable 
definitions (we detail our sample formation and variable definitions later in the paper). This 
figure reports event-time plots for firm-years belonging to extreme accrual deciles in event- 
year 0. Panel A of Figure 1 plots the mean value of the accrual component of earnings for 
each of the extreme accrual deciles. This plot illustrates that the accrual component of 
earnings is rapidly mean reverting. Panel B of Figure 1 plots the mean value of the ac- 
counting rate of return for the extreme accrual deciles. This plot illustrates that firms in the 
high (low) accrual decile tend to have a high (low) accounting rate of return in event period 
0, but tend to experience significant reductions (increases) in their accounting rate of return 
in the subsequent period. This is the regularity that we seek to explain in this paper. This 
regularity has proven to be robust across-a variety of samples and variable definitions (see 
Sloan 1996; FWY; RSST). 

Existing research offers two broad classes of explanations for the lower persistence of 
the accrual component of earnings. Following Sloan (1996), one line of research argues 
that it results from accounting distortions associated with the lower reliability of the accrual 
component of earnings. Following FWY, a second line of research argues that it is more 
likely to result from economic factors associated with growth in real investment and 
less likely to result from accounting distortions. The primary goal of our paper is to provide 
additional evidence that accounting distortions are a significant contributing factor to this 
empirical regularity. 

Existing evidence relating to alternative explanations is indirect and inconclusive. Pa- 
pers claiming to provide evidence in support of the accounting distortion explanation in- 
clude Xie (2001), Dechow and Dichev (2002), and RSST. Xie decomposes accruals into 
“normal” and “abnormal” accruals using the modified Jones model (see Dechow et al. 
1995). He finds that the lower persistence of earnings is primarily attributable to abnormal 
accruals. Based on this evidence, he concludes that the lower persistence of accruals is 
attributable to accounting distortions arising from earnings management. Dechow and 
Dichev (2002) attempt to isolate temporary. accounting distortions by extracting the resid- 
uals from regressions of accruals on past, present, and future cash flows. They demonstrate 
that the lower persistence of the accrual component of earnings is concentrated in these 
residuals. RSST decompose accruals along broad balance sheet classifications and rank 
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these classifications based on their assessment of the reliability of the underlying accrual 
categories. They show that the lower persistence of the accrual component of earnings is 
primarily attributable to accrual categories that they deem to be less reliable. 

Papers proposing economic explanations for the lower persistence of the accrual com- 
ponent of earnings include FWY, Cooper et al. (2005), and Khan (2005). FWY show that 
the lower persistence of the accrual component of earnings extends from the working capital 
accruals considered by Sloan (1996) to growth in noncurrent operating assets. They suggest 
that the lower persistence of the accrual component of earnings arises from the interaction 
of investment growth with the lower rates of return from diminishing marginal returns to 
increased investment. In follow up research, Cooper et al. (2005) show that FWY’s results 
extend to the growth rate in total assets. Following FWY, they also conclude that dimin- 
ishing marginal returns to new investment is the most likely explanation. Finally, recent 
research by Khan (2005) and Zach (2005) argues that accruals proxy for risk. Khan (2005) 
argues that firms with high accruals better enable investors to smooth intertemporal con- 
sumption patterns, while firms with low accruals expose investors to distress risk. Accord- 
ingly, firms with high (low) accruals earn lower (higher) equilibrium profits and stock 
returns. 

FWY also propose another possible explanation that combines both accounting distor- 
tions and economic growth. They argue that the widespread use of conservative accounting 
methods combines with growth in real investment to cause the lower persistence of the 
accrual component of earnings. We refer to this type of accounting distortion as a permanent 
accounting distortion, because it results from the consistent application of biased accounting 
methods. An example of such an accounting method would be the requirement to imme- 
diately and consistently expense research and development costs. Such permanent account- 
ing distortions are subtly different from the previously discussed temporary accounting 
distortions. Temporary accounting distortions arise from transitory estimation error in ac- 
cruals. Accruals represent estimates of future benefits and obligations. As such, they are 
more prone to estimation error than actual cash receipts or disbursements. An example of 
accrual estimation error would be the accrual of the allowance for uncollectible accounts 
on credit sales. A range of possible amounts could be feasibly booked, but only one of 
these amounts will reflect subsequent uncollectibles. All other amounts will contain tran- 
sitory estimation error that is reversed once actual collections are known. We refer to this 
type of error as a temporary accounting distortion, because it results from transitory esti- 
mation error rather than the consistent application of biased accounting methods. 

" In order to provide evidence on the role of accounting distortions, our research design 
employs three complementary approaches. First, to illustrate the effects of accounting dis- 
tortions on accruals and profitability, we develop a series of numerical examples. These 
examples assume constant marginal returns to investment, and so help us to isolate the 
impact of accounting distortions on accruals and profitability. They also allow us to deter- 
mine which combinations of accounting distortions and growth are consistent with empirical 
regularities in accruals and profitability. Second, to discriminate between accounting dis- 
tortions and diminishing marginal returns to new investment, we decompose accruals into 
a component attributable to growth in real investment (“growth component") and a com- 
ponent attributable to changes in the efficiency with which existing investments are em- 
ployed ("efficiency component"). The effects of diminishing marginal returns should be 
limited to the growth component. Thus, to the extent that the efficiency component drives 
the observed properties of accruals, we infer that accounting distortions explain these prop- 
erties. Third, to investigate the possibility that temporary accounting distortions are gen- 
erated by managerial manipulation, we examine the relation between accruals and SEC 
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enforcement actions for alleged earnings manipulations. The SEC issues an enforcement 
action in cases where it believes management has intentionally manipulated earnings in a 
manner that violates GAAP. Hence, evidence of a relation between SEC enforcement ac- 
tions and accruals is consistent with temporary accounting distortions resulting from man- 
agerial manipulation. 


Defining Accruals 

Following Healy (1985), most academic research defines accruals as the change in 
noncash working capital less depreciation expense (e.g., Sloan 1996; Xie 2001; Thomas 
and Zhang 2001; FWY). This definition, however, omits accruals relating to noncurrent 
operating assets/liabilities and financial assets/liabilities. RSST propose a broader definition 
of accruals that incorporates all differences between GAAP earnings and cash earnings. 
They argue that in the absence of accrual accounting, only cash and owner’s equity would 
appear on the balance sheet, because all other accounts are manifestations of the accrual 
accounting process. Under their definition, changes in all noncash asset and liability ac- 
counts represent accruals of expected future benefits or obligations. Total accruals therefore 
include working capital accruals (related to changes in noncash current operating asset/ 
liability accounts), noncurrent operating accruals (related to changes in noncurrent operating 
asset/liability accounts), and financing accruals (related to changes in noncash financial 
asset/liability accounts). 

RSST’s expanded definition of accruals contrasts sharply with the definition of accruals 
used by FWY. FWY follow the established convention of referring only to working capital 
accruals as ‘‘accruals,” and label noncurrent operating accruals as a generic form of 
"growth." FWY show that the change in net operating assets, which combines working 
capital accruals and noncurrent operating accruals, best captures the low-persistence com- 
ponent of earnings. They offer this result as their primary evidence against accounting- 
based explanations. The key difference, however, between working capital accruals and 
noncurrent operating accruals is that the future benefits and obligations associated with the 
latter usually take longer to be realized. Both are manifestations of the accrual accounting 
process, involving subjective estimates of future benefits and obligations. Moreover, as 
illustrated by the accounting debacle at WorldCom, noncurrent accruals are susceptible to 
earnings manipulation.! Thus, FWY's evidence does not help discriminate between ac- 
counting and economic-based explanations, because it is consistent with both. 

RSST argue that both working capital accruals and noncurrent operating accruals are 
of low reliability and, hence, particularly prone to temporary accounting distortions. Con- 
sistent with these arguments, they show that the working capital and noncurrent operating 
accrual components of earnings are less persistent than the cash flow and financing accrual 
components of earnings. Thus, RSST reinterpret FWY's growth in net operating asset vari- 
able as a more comprehensive measure of accruals and FWY’s results as a manifestation 
of temporary accounting distortions in accruals. Following RSST, we incorporate noncurrent 
operating assets into our definition of accruals, and we refer to the sum of working capital 
accruals and noncurrent operating accruals as total operating accruals. Thus, our definition 
of accruals includes all operating accruals, but excludes the more reliable financing accruals. 


WorldCom overstated its net income by more than $10 billion between 2000 and 2001, primarily by capitalizing 
line-operating costs in property, plant, and equipment. Dechow and Schrand (2004) and Nelson et al. (2003) 
also document evidence of earnings management through the inappropriate capitalization of operating costs. 
Surveys of earnings restatements by Huron Consulting Group also indicate that inappropriate capitalization of 
costs has been a major reason for restatements in recent years. 
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For expositional ease, we simply refer to this construct as the accrual component of earnings 
(denoted ACC) throughout the remainder of the paper. Note that given this definition of 
accruals, the corresponding definition of cash flows (i.e., the difference between earnings 
and accruals) is the standard finance textbook definition of free cash flow.” Exact variable 
definitions in terms of Compustat items are provided in Section IV. 


Growth versus Efficiency Decomposition 


FWY argue that diminishing marginal returns to new investment contribute to the lower 
persistence of the accrual component of earnings. In support of this argument, they provide 
a reference to Stigler (1963, 54). In this reference, Stigler asserts that: 


There is no more important proposition in economic theory than that, under competi- 
tion, the rate of return on investment tends toward equality in all industries. Entrepre- 
пеш will seek to leave relatively unprofitable industries and enter relatively profitable 
industries, and with competition, there will be neither public nor private barriers to 
these movements. 


Note that FWY measure growth at the firm level while Stigler’s (1963) arguments apply at 
the industry level. FWY’s argument therefore requires that firm-level growth has a signif- 
icant influence on industry-level output. This would only be the case if either the firm 
produces a significant proportion of industry output, or growth is strongly correlated across 
firms in the same industry. 

In order to investigate the diminishing returns to new investment explanation in more 
detail, we decompose accruals into a “growth” component and an “е сїепсу” component. 
Accruals can increase either because of real investment growth (whereby more operating 
assets lead to increased production and sales), or because of reduced efficiency (whereby 
more operating assets are required to generate the same level of production and sales). 
Diminishing marginal returns to new investment will be reflected in the growth component 
of accruals. The efficiency component of accruals, in contrast, will pick up either accounting 
distortions or less efficient use of existing capital (see also, Jansen and Yohn 2003). For 
example, assume that inventories increase without corresponding increases in sales or pay- 
ables. This increase will manifest itself as a decrease in the efficiency component of ac- 
cruals. The two most likely explanations for such an increase are that inventories are over- 
stated (ie., a temporary accounting distortion) or that the firm is managing physical 
inventories less efficiently. 

Existing research recognizes that real investment growth necessitates increased oper- 
ating assets and so will be associated with higher accruals (e.g., Jones 1991). Such accruals 
are often termed “nondiscretionary” or "normal" in the academic literature. Nondiscre- 
tionary (discretionary) accruals are traditionally modeled as the fitted values (residuals) 
from regressions of accruals on measures of growth in operating activities, such as sales 
growth.? Recent studies by Xie (2001) and Chan et al. (2006) find that the lower persistence 
of the accrual component of earnings is primarily attributable to the discretionary compo- 
nent of accruals. However, the statistically oriented decompositions used in these studies 
are subject to misspecification. First, they impose an ad hoc linear structure on a nonlinear 


2 Recall that free cash flow equals earnings, plus noncash charges such as depreciation, less changes in working 
capital, less investment expenditures. These adjustments to earnings correspond to the current and noncurrent 
operating accruals tbat we incorporate in our definition in accruals. 

* See Dechow et al. (1995) for a more detailed discussion of the efficacy of models used for isolating the 
discretionary and nondiscretionary components of accruals. 
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relation. Second, they require the estimation of key parameters (e.g., the sensitivity of 
accruals to sales growth). We use a parsimonious algebraic decomposition of accruals that 
mitigates these problems. 

The starting point for our growth versus efficiency decomposition is our measure of 
total operating accruals (ACC), defined as the change in net operating assets deflated by 
lagged operating assets. Net operating assets (МОА) is defined as the difference between 
noncash operating assets and operating liabilities. This definition corresponds to FWY’s 
"growth in net operating assets" variable and RSST's ‘“‘total operating accrual’ variable.’’* 


ACC, = (NOA, — NOA,_,)/NOA,_,. (2) 
We next define efficiency using the standard net operating asset turnover ratio (AT): 

AT, = Sales, NOA, (3) 
The following decomposition then follows with some simple algebra:? 


ACC, = ANOAJNOA, | 
= ASales/Sales., — AAT/AT, — (ASales,/Sates, ,)*(AAT,/AT) 


Sales Change in (4) 


Growth Efficiency Interaction 





Note that this decomposition is an algebraic identity that does not require the estimation 
of any additional parameters. The decomposition highlights the two determinants of oper- 
ating accruals. First, accruals are directly related to sales growth. If asset efficiency remains 
unchanged, then sales growth will lead to proportional increases in accruals. Second, ac- 
cruals are negatively related to efficiency. In the absence of sales growth, reductions in 
asset efficiency lead to proportional increases in accruals. The decomposition also contains 
an interaction term, indicating that a simple linear decomposition is not appropriate when 
sales growth and efficiency changes are correlated. A nonzero correlation is a distinct 
possibility. Economies of scale imply a positive correlation between sales growth and 
change in efficiency. Alternatively, diminishing marginal returns to new investment imply 
a negative correlation between sales growth and change in efficiency (because increased 
sales lead to lower sales prices, which lead to lower turnover). 

In line with the discussion at the beginning of this section, growth in real investment 
should be reflected by the growth component of accruals. In contrast, temporary accounting 
distortions, such as the overstatement of inventory, should be captured by the efficiency 
component of accruals. Correlated changes in both growth and distortions will be picked 
up by the interaction. Note that one potential limitation of these tests is that the growth 
component of accruals could also be affected by accounting distortions in sales (e.g., over- 
statement of accounts receivable). Thus, we cannot unambiguously attribute the properties 
of the growth component to diminishing returns. We can, however, rule out the possibility 
that properties of the efficiency component are attributable to diminishing returns to new 


4 FWY and RSST use average total assets as the deflator. We use lagged net operating assets, as this allows for 
a parsimonious algebraic decomposition. As a practical matter, however, our empirical results are qualitatively 
identical if we deflate by either end-of-year NOA, average NOA, or average total assets. 

5 See the Appendix for the proof. 
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investment. Thus, evidence of a significant role for the efficiency component of accruals in 
explaining the lower persistence of earnings is consistent with a significant role for tem- 
porary accounting distortions. 


Ш. THE EFFECT OF ACCOUNTING DISTORTIONS 

We now turn to our numerical examples in order to better understand the impact of 
accounting distortions on accruals and profitability. Our examples are adapted from related 
examples in Penman (2001) modeling the effects of permanent accounting-distortions. We 
extend Penman’s (2001) examples to consider both permanent and temporary accounting 
distortions.$ Recall that the empirical regularity we seek to explain is the positive association 
between accruals and contemporaneous profitability and the negative association between 
accruals and subsequent changes in profitability. FWY refer to Penman's (2001) examples 
to support their claim that this empirical regularity is consistent with the interaction of 
permanently conservative accounting and growth.? 

Our numerical examples are based on a generic investment opportunity generating a 
constant economic return of 10 percent. In order to give the investment opportunity the 
characteristics of the business operations of a typical firm, we assume that the investment 
generates sales revenue and requires operating expenditures. The investment is made (in 
cash) at the end of the current period and generates sales revenues (in cash) at the end of 
the next period in the amount of 165 percent of the investment. Operating expenses are 
also incurred (in cash) at the end of the next period in the amount of 55 percent of the 
investment. These assumptions generate cash flow at the end of the next period equal to 
110 percent of the current period investment, to give the 10 percent economic rate of return. 
The appropriate accounting to capture the underlying economics of this simple investment 
opportunity should be clear. The benefits to investment made at the end of period ¢ are all 
realized at the end of period t+1. Hence, all investment in period г should be capitalized 
at the end of period t and expensed at the end of period t-- 1. Such accounting will always 
result in accounting rates of return that correspond to the economic rate of return of 10 
percent (see subsequent examples for details), and we refer to such accounting as “neutral 
accounting." 

In order to study the effect of accounting distortions on accruals and profitability, we 
introduce "conservative" and “aggressive” accounting alternatives to the.neutral accounting 
described above. Our conservative accounting method only capitalizes 80 percent of the 
investment made at the end of each period, despite the fact that all:the benefits from 
the investment are received at the end of the subsequent period. An: example of such 
accounting would be a situation where 20 percent of the investment is. determined to rep- 
resent research and development costs, and hence immediately expensed.under GAAP. On 
the other hand, our aggressive accounting method capitalizes 120 percent of the investment 
made at the end of each period. This is accomplished by capitalizing some of the cash 


$ We stress that our numerical examples are not intended to provide evidence on the role of diminishing marginal 
returns to investment in explaining the lower persistence of the accrual component of earnings. Neither Penman 
(2001) nor FWY use examples to illustrate the impact of diminishing marginal returns to investment. Note that 
if we construct a model in which we incorporate an assumption of diminishing marginal returns, it is intuitively 
obvious that returns will diminish, so such a modeling exercise is redundant. 

7 Penman (2001, 561) refers to an example illustrating that growth in real investment interacts with conservative : 
accounting methods to understate profitability in high accrual/investment periods and overstate profitability in 
subsequent periods. This example is opposite to the empirical regularity that we are trying to explain. Penman 
(2001, 565) makes the (correct) point that steady-state investment combines with conservative accounting meth- 
ods to permanently inflate accounting rates of return. This example is also not descriptive of our empirical 
regularity. 
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operating costs for that period in addition to the entire investment. An example of such an 
accounting practice would be the capitalization of maintenance costs that are not expected 
to generate benefits in subsequent periods. Note that under both the aggressive and con- 
servative accounting methods, we assume that all amounts capitalized in any period are 
expensed at the end of the subsequent period.’ 

The second dial that we turn in our example is the growth rate in investment. Recall 
that FWY propose an explanation based on the interaction of real investment growth with 
permanent accounting distortions. We therefore construct our example to cover seven con- 
secutive periods and we consider how investment growth affects accounting rates of return 
over these seven periods. We model a positive growth scenario, initially starting with a 0 
percent growth rate for two. periods, increasing to a 5 percent growth rate for the next 
period, increasing to 20 percent for one period, then declining to a 5 percent growth rate 
for one period, followed by a 0 percent growth rate for the final two periods. We select 
this pattern of growth as it is representative of the sequence of growth rates experienced 
by a typical “high-growth” firm, with the period of 20 percent growth representing the 
high growth period (see discussion of Table 3 in Section V). Growth tends to be rapidly 
mean reverting, so firms with unusually high growth in investment in one period tend to 
have a declining pattern of growth in surrounding periods (see Nissim and Penman 2001). 
We also briefly describe the corresponding negative growth scenario that is the mirror image 
of the positive growth scenario with growth initially starting at 0 percent for two periods, 
followed by —5 percent for one period, —20 percent for one period, —5 percent for one 
period, and 0 percent for the final two periods. 

Within each of the growth scenarios, we consider five different accounting regimes: the 
neutral accounting scenario, two aggressive accounting scenarios, and two conservative 
accounting scenarios. The reason that we have two aggressive and conservative accounting 
scenarios is that we want to consider the effects of both permanent and temporary account- 
ing distortions. Recall that permanent accounting distortions result from the consistent ap- 
plication of biased accounting methods. These are the types of distortions considered by 
Penman (2001) and FWY. Temporary accounting distortions, in contrast, result from tran- 
sitory estimation errors in accruals. These are the types of distortions considered by Dechow 
and Dichev (2002) and RSST. Our numerical example reports the patterns in profitability 
(RNOA, measured as operating income on lagged operating assets) and the accrual com- 
ponent of profitability (ACC, measured as the percentage change in net operating assets) 
for each of the above scenarios. 

Table 1 details our examples. We begin with an initial investment of $100 and then 
track tbe financial performance over the subsequent seven periods. We introduce any tem- 
porary accounting distortions in the fourth period, labeling this period as event period O 
and other periods relative to this period. The first row of Panel A simply lists the growth 
rates in investment. The second row lists the dollar amount of investment in each period 
that results from sequentially applying these growth rates to the initial investment of 100 
in period —4. The third and fourth rows show the amounts of operating costs and investment 
costs that are capitalized in each period. In the neutral accounting scenario, none of the 
operating costs are capitalized and 100 percent of the investment costs are capitalized in 


* We assume the benefits from the investment and associated amortization of capitalized investment are all realized 
in the subsequent period. These assumptions are clearly arbitrary and are chosen to be a reasonable but not 
exhaustive representation. The key takeaways from the examples are robust to assuming that the investment 
benefits and amortization expenses occur over multiple future periods. The only major qualification is that the 
accounting distortions reverse over more than one future period. This finding is comforting, since the impact of 
accrual reversals on earnings also takes more than one period in our empirical data (see Figures 1 and 2). 
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TABLE 1 
Numerical Examples Illustrating the Effect of Accounting Distortions 
on Profitability and Accruals 


Numerical examples illustrating the relation between profitability (RNOA) and accruals (ACC) for 
firms experiencing a combination of positive growth and accounting distortions. The numerical 
examples are based on a simple firm making an investment in period t that generates sales in 
period t-- 1. Sales аге 165% of the period t investment and operating expenses are 55% of the 
period t investment (both incurred in cash in period +1). In the neutral accounting example, 100% 
of investment is capitalized in period t and expensed in period t-- 1. In the permanently 
(temporarily) conservative accounting case the firm always (in year 0 only) capitalizes 80% of the 
investment in year t and expenses it in t-- 1. In the permanently (temporarily) aggressive accounting 
case the firm always (in year 0 only) capitalizes 120% of the investment in year t and expenses it 
in 1+1. RNOA is measured as Operating Income divided by lagged МОА. Accruals (ACC) are 
defined as the change in NOA divided by lagged NOA. 





-4 53 2 Rua _0б а 
Panel A: Neutral Accounting with Positive Growth 
Growth Rate in Investment 0.0% 0.0% 5.0% 20.0% 5.0% 0.0% 0.0% 
Investment 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 


Capitalized Operating Costs 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
Capitalized Investment Costs 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 











Total Capitalized Costs 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 
Sales 0.00 165.00 165.00 165.00 173.25 207.90 218.30 218.30 
Operating Expense 0.00 55.00 55.00 55.00 57.75 69.30 7277 72.77 
Investment Expense 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
Amortization Expense 0.00 100.00 100.00 100.00 105.00 126.00 132.30 132.30 
Operating Income 0.00 10.00 10.00 10.00 10.50 12.60 13.23 1323 
Net Operating Assets (NOA) 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 
RNOA 10.0% 10.0% 10.0% 100% 10.0% 10.0% 10.0% 
Accruals (ACC) 0.0% 009 5.0% 20.0% 5.0% 0.0% 0.0% 
Panel В: Permanently Conservative Accounting with Positive Growth 

Growth Rate in Investment 0.0% 0.0% 5.0% 20.0% 5.0% 0.0% 0.0% 
Investment 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 


Capitalized Operating Costs 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
Capitalized Investment Costs 80.00 80.00 80.00 8400 10080 105.84 105.84 105.84 








Total Capitalized Costs “80.00 80.00 8000 84.00 100.80 105.84 105.84 105.84 
Sales 0.00 165.00 165.00 165.00 173.25 207.90 218.30 218.30 
Operating Expense 0.00 55.00 55.00 5500 5775 69.30 72.77 72.77 
Investment Expense 20.00 20.00 20.00 2100 2520 2646 2646 2646 
Amortization Expense 0.00 80.00 80.00 80.00 84.00 100.80 105.84 105.84 
Operating Income -20.00 10.00 10.00 900 630 1134 1323 1323 
Net Operating Assets (МОА) 80.00 80.00 80.00 84.00 100.80 105.84 105.84 105.84 
RNOA 12.5% 12.5% 113% 7.5% 113% 12.5% 12.5% 
Accruals (ACC) 0.0% 009 5.0% 200% 5.0% 0.0% 0.0% 


(continued on next page) 
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TABLE 1 (Continued) 








-4 -3 -2 -1 0 1 2 3 

Panel С: Permanently Aggressive Accounting with Positive Growth 

Growth Rate in Investment 0.0% 0.0% 5.0% 20.0% 5.0% 0.0% 0.0% 
Investment 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 
Capitalized Operating Costs 2000 20.00 2000 21.00 2520 2646 2646 26.46 
Capitalized Investment Costs 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 
Total Capitalized Costs 120.00 120.00 120.00 126.00 151.20 158.76 158.76 158.76 
Sales 0.00 165.00 165.00 165.00 173.25 207.90 218.30 218.30 
Operating Expense —20.00 35.00 35.00 34.00 32.55 42.84 46.31 46.31 
Investment Expense 0.00 000 0.00 000 0.00 0.00 000 0.00 
Amortization Expense 0.00 120.00 120.00 120.00 126.00 15120 158.76 158.76 
Operating Income 20.00 10.00 1000 11.00 14.70 13.86 13.23 1323 
Net Operating Assets (NOA) 120.00 120.00 120.00 126.00 151.20 158.76 158.76 158.76 
RNOA 83% 83% 9.2% 11.7% 92% 83% 83% 
Accruals (ACC) 0.0% 0.0% 5.0% 20.0% 5.0% 0.0% 0.0% 
Panel D: Temporarily Conservative Accounting with Positive Growth 

Growth Rate in Investment 0.0% 0.0% 5.0% 20.0% 5.0% 0.0% 0.0% 
Investment 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 
Capitalized Operating Costs 000 000 000 0.00 0.00 0.00 0.00 0.00 
Capitalized Investment Costs 100.00 100.00 100.00 105.00 100.80 132.30 132.30 132.30 
Total Capitalized Costs 100.00 100.00 100.00 105.00 100.80 132.30 132.30 132.30 
Sales 0.00 165.00 165.00 165.00 173.25 207.90 218.30 218.30 
Operating Expense 0.00 55.00 55.00 55.00 57.75 69.30 72.77 72.77 
Investment Expense 0.00 0.00 0.00 0.00 25.20 0.00 0.00 0.00 
Amortization Expense 0.00 100.00 100.00 100.00 105.00 100.80 132.30 132.30 
Operating Income 0.00 10.00 10.00 10.00 -—14.70 37.80 1323 1323 
Net Operating Assets (NOA) 100.00 100.00 100.00 105.00 100.80 132.30 132.30 132.30 
RNOA 10.0% 10.0% 10.0% -14.0% 37.5% 10.0% 10.0% 
Accruals (ACC) 0.0% 0.0% 50% —4.0% 313% 00% 0.0% 
Panel E: Temporarily Aggressive Accounting with Positive Growth 

Growth Rate in Investment 0.0% 0.0% 5.0% 20.0% 5.0% 0.0% 0.0% 
Investment 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 
Capitalized Operating Costs ` 0.00 000 0.00 0.00 25.20 0.00 000 0.00 
Capitalized Investment Costs 100.00 100.00 100.00 105.00 126.00 132.30 132.30 132.30 
Total Capitalized Costs 100.00 100.00 100.00 105.00 151.20 132.30 132.30 132.30 
Sales 0.00 165.00 165.00 165.00 173.25 207.90 218.30 218.30 
Operating Expense 0.00 55.00 55.00 55.00 32.55 69.30 7277 72.77 
Investment Expense 0.00 0.00 0.00 0,00 0.00 0.00 0.00 0.00 
Amortization Expense 0.00 100.00: 100.00 100.00 105.00 151.20 132.30 132.30 
Operating Income 0.00 10.00 10.00 10.00 3570  —12.60 13.23 13.23 
Net Operating Assets (NOA) 100.00 100.00 100.00 105.00 151.20 13230 132.30 132.30 
RNOA 10.0% 10.0% 10.090 340% -83% 1009 10.0% 
Accruals (ACC) 0.0% 0.0% 50% 44.0% -12.5% 0.0% 0.0% 
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each period. The next five rows present the periodic income statements. Sales in each period 
are equal to 165 percent of the prior period investment and operating expenses are equal 
to 55 percent of the prior period investment. Investment expense is zero (because 100 
percent of investment costs are capitalized) and amortization expense is equal to 100 percent 
of the prior period investment. Operating income is derived by subtracting expenses from 
sales. The next row represents the net operating assets at the end of the period and is simply 
equal to the amount of costs that have been capitalized for that period (remember that all 
capitalized costs are assumed to be expensed at the end of the subsequent period, so we 
do not have to consider operating assets from previous periods). The final two rows report 
the resultant trends in profitability (RNOA, measured as operating income on lagged oper- 
ating assets) and accruals (ACC, measured as the percentage change in net operating assets). 

Panel A of Table 1 reports the neutral accounting scenario. RNOA is constant and equal 
to the economic rate of return of 10 percent for all periods. This base case illustrates that 
in the absence of accounting distortions, the accounting rate of return simply equals the 
economic rate of return. Note also that in this base case scenario, ACC is synonymous with 
the growth rate in real investment. 

Panel B of Table 1 introduces serian conservative accounting into the mix. This 
example gives us our first look at how growth in investment and accounting distortions 
interact to produce changes in accounting rates of return. It shows that combining perma- 
nently conservative accounting with growth in investment causes the accounting rate of 
return to deviate from the economic rate of return of 10 percent. With the exception of 
year 0, the RNOA is always higher than the economic rate of return of 10 percent. In year 
0, however, the RNOA is only 7.5 percent. To understand why this happens, note that 
conservative accounting can affect both the numerator (i.e., operating income) and the 
denominator (i.e., net operating assets) of the RNOA computation. The denominator effect 
is to always understate investment, leading to an upward bias in reported RNOA. The 
numerator effect depends on investment growth. With zero growth in investment, there is 
no net effect on the numerator. This is because the impact of the origination of new con- 
servative accounting exactly offsets the impact of the reversal of old conservative account- 
ing. With positive growth in investment, the numerator is depressed relative to the neutral 
accounting scenario. This is because the origination of new conservative accounting de- 
presses income by more than the reversal of old conservative accounting boosts income. If 
the growth rate is greater than the economic rate of return, then the numerator effect will 
dominate the denominator effect, causing RNOA to drop below the economic rate of return 
(1.е., period 0 of Panel B). Similar effects are illustrated by the examples in Penman (2001, 
567 and 571). Finally, note that as in the previous example, ACC continues to be synony- 
mous with the growth rate in real investment. This is because permanently conservative 
accounting understates net operating assets by the same percentage in each period, so the 
growth rate in net operating assets is not affected. 

FWY hypothesize that the lower persistence of the accrual component of earnings is 
consistent with a combination of growth in investment and conservative accounting meth- 
ods. However, the patterns in accruals and profitability in Panel B of Table 1 are clearly 
inconsistent with the empirical regularity described in Section П (i.e., periods with high 
accruals having high contemporaneous RNOA, and lower subsequent RNOA). Instead, we 
see that the increase in accruals in period 0 is associated with low contemporaneous RNOA 
and higher subsequent RNOA. 

Panel С of Table 1 considers a combination of permanently aggressive accounting with 
positive growth. Recall from Panel B of Table 1 that permanently conservative accounting 
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combines with positive growth to reduce RNOA. For analogous reasons, permanently ag- 
gressive accounting combines with positive growth to increase RNOA. This happens because 
the origination of new aggressive accounting increases income by more than the reversal 
of old aggressive accounting reduces income. This causes both accruals and RNOA to be 
overstated in year 0 and drop back down in year 1. These patterns are consistent with the 
empirical regularity we are trying to explain. High accruals are associated with high con- 
temporaneous RNOA and lower subsequent RNOA. Thus, in contrast to the arguments in 
FWY, permanently aggressive accounting and growth combine to explain the regularity, but 
permanently conservative accounting and growth do not. 

Panel D of Table 1 considers a combination of positive investment growth and tem- 
porarily conservative accounting. Neutral accounting is applied for event years —4 through 
—1, conservative accounting is applied in year 0, and neutral accounting is again applied 
in years 1 through 3. The application of conservative accounting in year 0 has implications 
for both year 0 and year 1. Conservative accounting in year 0 deflates year 0 RNOA to 
—14 percent. The conservative accounting also causes accruals to temporarily understate 
real investment, causing ACC to fall to —4 percent. Year 1 sees the reversal of the conser- 
vative accounting from period 0, causing income to be overstated. Conservative accounting 
also causes net operating assets at the beginning of period 1 (ie., the end of period 0) 
to be understated. These two effects combine to inflate period 1 RNOA to 37.5 percent. 
ACC is also overstated in year 1, because net operating assets are understated at the end 
of period 0. ; i 

The patterns observed in Panel D of Table 1 are broadly consistent with the empirical 
regularity described in Section II. Temporarily conservative accounting causes low accruals 
and low contemporaneous RNOA, followed by increased accruals and RNOA in the sub- 
sequent period. Note that the growth rate in investment is not critical to achieving this 
result. Regardless of the growth rate in investment, temporarily conservative accounting 
will depress current period RNOA and ACC and inflate subsequent period RNOA and ACC. 
A representative real-world example of such an accounting distortion would be the write- 
down of inventory or PP&E. Write-downs depress current period profitability and accruals, 
but inflate future period profitability and accruals. 

Finally, Panel E of Table 1 considers the combination of temporarily aggressive ac- 
counting with positive growth in investment. This panel represents the flip side of the 
temporarily conservative accounting in Panel D. Temporarily aggressive accounting causes 
high accruals and contemporaneous RNOA, followed by lower accruals and RNOA in the 
subsequent period. These patterns are also consistent with the empirical regularity described 
in Section I. Note again that the growth rate in real investment is not required to achieve 
: this result. Even in the absence of growth, temporarily aggressive accounting will inflate 
current period RNOA and ACC and deflate subsequent period RNOA and ACC. 

There are three key takeaways from Table 1. First, in contrast to the arguments in FWY, 
the numerical examples indicate that the interaction of growth in real investment and per- 
manently conservative accounting is inconsistent with the empirical regularity we are trying 
to explain. In fact, permanently conservative accounting causes patterns in accruals and 
RNOA that are exactly opposite to those observed empirically. Second, consistent with 
Dechow and Dichev (2002) and RSST, the examples indicate that the empirical results are 
consistent with temporary accounting distortions. Note that temporary accounting distor- 
tions achieve this result even in the absence of growth in real investment. Finally, the 
examples also indicate that the empirical results are consistent with a combination of growth 
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in real investment and permanently aggressive accounting.” Note, however, that our ex- 
amples ignore the possibility of alternative explanations, such as diminishing marginal re- 
turns to investment. The examples also do not help us to discriminate between the com- 
peting explanations based on temporary accounting distortions versus growth combined 
with permanently aggressive accounting distortions. To address these issues, we turn next 
to our empirical tests. 


IV. DATA 

Our empirical tests employ data from two sources. Financial statement data are obtained 
from the Compustat Industrial annual database. Information relating to SEC enforcement 
actions is found using the Lexis-Nexis Academic Universe product. Our sample period 
covers all firm years with available data on Compustat for the period 1962—2001. We also 
exclude all firm-year observations with SIC codes in the range 6000—6999 (financial 
companies). We eliminate firm-year observations with insufficient data on Compustat to 
compute the primary financial statement variables used in our tests.!° These criteria yield 
a final sample size with non-missing financial statement and returns data of 106,423 firm- 
year observations. Finally, we identify firm-years that were subject to enforcement actions 
for alleged GAAP violations associated with earnings overstatements by reading SEC en- 
forcement actions reported on Lexis-Nexis over our sample period. SEC enforcement ac- 
tions are only available from 1979 onward, so we limit ourselves to this period for the SEC 
enforcement action tests, reducing this sample to 76,165 firm-year observations. Of this 
sample, 169 firm-years are subject to SEC enforcement actions for alleged earnings 
overstatements. 

We calculate our measure of accruals (ACC) as (МОА, — NOA, (У/МОА, , NOA is the 
difference between operating assets (OA) and operating liabilities (OL). OA is equal to the 
sum of current operating assets (COA) and noncurrent operating assets (NCOA) and OL is 
equal to the sum of current operating liabilities (COL) and noncurrent liabilities (NCOL). 
These accrual components are calculated as follows: 


COA = Current Assets (Compustat Item #4) — Cash and Short-Term Investments 
(STI) (Compustat Item #1). 


NCOA = Total Assets (Compustat Item #6) — Current Assets (Compustat Item #4) 
` — Investments and Advances (Compustat Item #32). 


COL = Current Liabilities (Compustat Item #5) — Debt in Current Liabilities 
(Compustat Item #34). 


? In unreported numerical examples we also investigate the interaction of accounting distortions with negative 
growth in real investment. There are two takeaways. First, temporarily conservative and aggressive accounting 
continue to explain the empirical regularity in Sloan (1996) and FWY. Second, permanently aggressive account- 
ing interacts with negative growth to explain the empirical regularity. In other words, permanently aggressive 
accounting interacts with negative growth to temporarily depress accruals and profitability. 

10 Specifically, we require availability of Compustat data items 1, 4, 5, 6, 12, and 181 in both the current and 
previous year and data item 178 in the current year in order to keep a firm-year observation in the sample. If 
data items 9, 32 or 34 are missing, then we set them equal to zero rather than eliminating the observation. These 
data items represent balance sheet items that may not be relevant for some companies (e.g., investments and 
advances), so we set them to zero rather than needlessly discarding observations. All results are qualitatively 
similar if we instead estimate each regression using only observations with all data available for that particular 
regression. 
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NCOL = Total Liabilities (Compustat Item #181) — Current Liabilities (Compustat 
Item #5) — Long-Term Debt (Compustat Item #9). 


Recall that the decomposition in Equation (4) calls for deflation by lagged NOA.!! We 
exclude firm-year observations with negative values of this deflator variable from our anal- 
ysis. Following previous research, all financial statement ratios are winsorized at +/—1 
respectively to mitigate the influence of outliers. RNOA is calculated as operating income 
after depreciation (Compustat Item #178) deflated by lagged net operating assets.!? For our 
analysis of the persistence in the various components of earnings, we use this same defi- 
nition of earnings. For our growth versus efficiency decomposition, we compute sales 
growth (SG) as the change in sales (Compustat item #12) scaled by lagged sales and we 
compute change in asset turnover (AAT) as shown in Equation (4). 


V. RESULTS 

We present our results in three subsections. We begin by presenting descriptive statistics 
for accruals and related financial variables. These descriptive statistics allow us to examine 
the correspondence between the empirical properties of accruals and profitability and the 
properties generated in our examples. We next present tests examining the relation of the 
growth and efficiency components of accruals with respect to future operating profitability. 
Finally, we examine the relation between accruals and SEC enforcement actions for alleged 
overstatements of earnings. 


Descriptive Statistics 


Panel A of Table 2 provides descriptive statistics for accruals (ACC) and related finan- 
cial variables. The mean and median values for ACC are both positive, indicating that the 
typical firm's operating assets have grown over the sample period. The standard deviations 
of SG and AAT are both around 0.3, indicating that they are both important scurces of 
variation in ACC. Finally, both the mean and median values of RNOA are around 15 
регсепі. З 

Panel В of Table 2 reports pair-wise correlations for the financial variables. SG and 
AAT are strongly positively correlated, indicating that growing firms tend to experience 
increases in asset turnover, which is consistent with economies of scale. The positive cor- 
relation between these two components is indicative of a potentially important role for the 
interaction component of the sales and efficiency decomposition that we derived in Equation 


1! Fairfield et al. (2003b) argue that the lower persistence of accruals relative to cash flows is primarily ettributable 
to growth in the investment base that is not matched by growth in income. In other words, firms with high 
operating accruals are firms that are growing their operating investments, but earn a lower average return on 
their increased operating investments, leading to lower persistence in earnings performance. To investigate the 
importance of this issue for our results, we replicated all of the results in Table 3 using the Fairield et al. 
(2003b) approach of measuring our dependent variable with the same deflator as our independent variables. Our 
results are robust with respect to this procedure. For example, all accrual components continue to have a 
significantly negative coefficient. 

12 Note that we use a measure of operating income in our analysis of earnings persistence as it is unaffected by 
nonrecurring components of net income that are reported further down on the income statement. This increases 
the overall power of our persistence regressions. However, inferences regarding the relative persistence of the 
different components of earnings are qualitatively similar if we instead use bottom line net income (Compustat 
item 172). 

13 This is higher than the RNOA reported in Nissim and Penman (2001) of 10 percent, but they compute after-tax 
RNOA in their analysis. 
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TABLE 2 
Descriptive Statistics for Total Operating Accruals and Related Financial Variables 
(the sample covers 106,423 firm-year observations for the period 1962-2001) 


Panel A: Univariate Statistics 





Mean Std. Dev. 25% Median 75% 

АСС. 0.164 0.342 —0.023 0.092 0.280 
SG 0.159 0.295 0.009 0.111 0.252 

. AAT —0.036 0.315 —0.134 0.006 0.121 
RNOA 0.149 0.338 0.056 0.154 0.279 


Panel B: Correlation Matrix—Pearson (above diagonal) and Spearman (below diagonal) 
(p-values shown below correlations) 


ACC, SG, AAT, RNOA, RNOA,,, || SECAction, 
ACC, 04932 —0.6206 0.2034 0.0315 0.0250 
= (0.0001) (0.0001) (0.0001) (0.0001) (0.0001) 
SG, 0.4763 0.2717 0.1643 0.0780 0.0227 
(0.0001) fx (0.0001) (0.0001) (0.0001) (0.0001) 
-. AAT, —0.5825 0.2820 —0.0420 0.0745 —0.0118 
(0.0001) (0.0001) E (0.0001) (0.0001) (0.0001) 
` RNOA, 0.3450 0.3406  —0.0878 0.7541 0.0040 
(0.0001) (0.0001) (0.0001) = (0.0001) (0.1966) 
ЕМОА, а 0.1015 0.2039 0.0685 0.7357 —0.0175 
(0.0001) (0.0001) (0.0001) (0.0001) = (0.0001) 
SECAction, 0.0224 0.0188 —0.0121 ` 0.0040 —0.0181 
(0.0001) | (0.0001) (0.0001) (0.1970) (0.0001) == 
АСС = total accruals defined as the change in Net Operating Assets (МОА). It is calculated as 


operating 
[(МОА, — NOA,_,)/NOA,_.]; 
NOA = Operating Assets (OA) — Operating Liabilities (OL); 
OA = the sum of Current Operating Assets (COA) and Noncurrent Operating Assets (NCOA); 
OL = the sum of Current Liabilities (COL) and Noncurrent Operating Liabilities (УСОЇ); 
COA = Current Assets (Compustat Item #4) — Cash and Short-Term Investments (571) (Compustat Item 
#1); 
NCOA = Total Assets (Compustat Item #6) — Current Assets (Compustat Item #4) 
— Investments and Advances (Compustat Item #32); 
COL = Current Liabilities (Compustat Item #5) — Debt in Current Liabilities (Compustat Item #34); 
NCOL = Total Liabilities (Compustat Item #181) — Current Liabilities (Compustat Item #5) 
— Long-Term Debt (Compustat Item #9); 
SG = sales growth calculated as [(Sales,/Sales, |) — 1], where sales is Compustat Item #12; 


oe Sales,\ _ сон Sales, 

NOA, NOA,.., NOA; 

RNOA = return on net operating assets for the current year. It is calculated as income from continuing 
operations (Compustat Item #176) deflated by beginning net operating assets; and 


SECAction, = an indicator variable equal to 1 if the firm-year observation is subject to a SEC Enforcement 
Action in year t, and 0 otherwise. 








(4). We also see that ACC, SG, and AAT are all systematically related to SEC enforcement 
actions. We will explore these relations more fully in our subsequent regression analyses. 

Table 3 and Figure 1 report temporal trends in financial performance for firms with 
extreme accruals. We first rank firms annually on the magnitude of ACC and allocate them 
to decile portfolios in ascending order of ACC. We then track the mean (median) ACC, SG, 
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TABLE 3 
Time-Series Pattern in Total Operating Accruals (ACC), Sales Growth (SG), Changes in Asset 
Turnover (AAT), and Accounting Rates of Return (RNOA) for Extreme Accrual Deciles 
(the sample consists of 106,423 firm-year observations from 1962—2001) 


Year 


Panel A: Low ACC Decile 


ACC 0175 0173 0.155 0.084  -0379 0.100 0.130 0135 0.133 
(0.086) (0.083) (0.064) (0.010) (—0.330) (0.024) (0.063) (0.068) (0.067) 
56 0.166 0.159 0.143 0.082 —0.044 0.016 0087 0.093 0.096 
(0.102) (0.095) (0.076) (0.036) (-0.051) (0.011) (0.061) (0.067) (0.069) 
AAT  —0.050 -0054 -0058 -0054 0276 -0102 -0060 -0059  —0.059 
(0.001) (0.000) (—0.002) (0.010) (0286) (-0.016) (0.005) (0004) (—0.003) 
RNOA 0.095 0070 0.028 -0034 -0082 | -0.019 0037 0.064 0081 
(0.111) (0.097) (0.068) (0.027) (-0.008) (0054) (0094) (0.110) (0.117) 


Panel B: High ACC Decile 


ACC 0.186 . 0.197 0.228 0.287 0.863 0.259 0.182 0.152 0.138 
(0.109) (0.116) (0.147) (0.234) (1.000) (0.188) (0.118) (0.095) (0.084) 

SG 0204 0.223 0.263 0.332 0.444 0.306 0.168 0.145 0.134 
(0.145) (0.159) (0.197) (0.264) (0.397) (0.245) (0.130) (0.113) (0.106) 

AAT -0.023 -0018  —0.012 —0.007 -0483 -0009 -0035 -0028 -0.023 
(0.024) (0.030) (0.037) (0.029) (—0.470) (0.035) (0.009) (0.015) (0.020) 


RNOA 0.135 0.146 0.179 0.226 0.235 0.106 0.104 0.109 0.112 
(0.155) (0.169) (0.204) (0263) (0289) (0.141) (0.129) (0.127) (0.128) 


Mean (median) estimates are reported in the first row (second row in parentheses) for each variable. 
We sort firm-year observations into ten equal sized portfolios each year. We then track the mean (median) values 
of RNOA, SG, AAT, and ACC for the extreme deciles for four years either side of the ranking year (year 0). 
ACC = total operating accruals defined as the change in Net Operating Assets (NOA). It is calculated as 
[WOA, — МОА, ))/ МОА, ,]; 
NOA = Operating Assets (OA) — Operating Liabilities (OL); 
OA = the sum of Current Operating Assets (СОА) and Noncurrent Operating Assets (NCOA); 
OL = the sum of Current Operating Liabilities (COL) and Noncurrent Operating Liabilities (NCOL); 
COA = Current Assets (Compustat Item #4) — Cash and Short-Term Investments (STT) (Compustat Item #1); 
NCOA = Total Assets (Compustat Item #6) — Current Assets (Compustat Item #4) 
— Investments and Advances (Compustat Item #32); 
COL = Current Liabilities (Compustat Item #5) — Debt in Current Liabilities (Compustat Item dx 
NCOL = Total Liabilities (Compustat Item #181) — Current Liabilities (Compustat Item #5) 
— Long-Term Debt (Compustat Item #9); 
SG = sales growth calculated as [(Sales,/Sales, 1) — 1], where sales is Compustat Item #12; 
Sales, Sales, Sales, 
MP (тог) (пол) | МОА, "04 


RNOA = retum on net operating assets for the current year. It is calculated as income from continuing 
operations (Compustat Item #178) deflated by beginning net operating assets. 


AAT, and RNOA for the extreme low-ACC and high-ACC portfolios for the nine-year period 
centered on the ranking year. Several regularities are immediately obvious. First, accruals 
are rapidly mean reverting. Second, consistent with FWY and RSST, accounting rates of 
return track the patterns in accruals. In particular, we see that both the mean and median 
RNOA for the low (high) accrual decile are lowest (highest) in year 0, but then increase 
(decrease) sharply in year 1. Third, we see that both SG and AAT closely track the patterns 
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in accruals. Note, however, that the rate of mean reversion is much greater for AAT than it 
is for SG. In fact, as is clear from Figure 1, Panel D, AAT shows signs of weak negative 
serial correlation. This pattern is consistent with the predictions of our temporarily conser- 
vative accounting distortion examples from Panels D and E of Table 1, whereby NOA are 
temporarily misstated in the year of the distortion. 

Overall, these patterns in RNOA, SG, and AAT are consistent with the temporarily 
conservative and aggressive accounting distortion examples discussed in Section Ш. The 
low accrual decile corresponds to temporarily conservative accounting (see Panel D of Table 
1) and the high accrual decile corresponds to temporarily aggressive accounting (see Panel 
E of Table 1). These patterns are also broadly consistent with the interaction of permanently 
aggressive accounting with growth in real investment. The high accrual decile corresponds 
to aggressive accounting combined with positive growth (Panel C of Table 1) and the low 
accrual decile corresponding to aggressive accounting combined with negative growth. In 
short, these results are consistent with either temporarily aggressive and conservative ac- 
counting or the interaction of permanently aggressive accounting with growth. Contradicting 
the arguments in FWY, these results are inconsistent with the interaction of permanently 
conservative accounting with growth in real investment (see Panel B of Table 1). 


Persistence Results 


Table 4 presents cur analysis of the persistence of the cash flow and accrual components 
of earnings. We use the approach adopted in FWY and RSST and previously described in 
our development of Equation (1). We document the lower persistence of the accrual com- 
ponent of earnings by regressing future RNOA on current RNOA and ACC and testing for 
a negative coefficient on ACC. Using this specification, the coefficient on ACC represents 
the difference between the coefficient on the accrual component of RNOA and the coefficient 
on the cash component of RNOA (see FWY; RSST). Specifically, in Panel A of Table 4 we 
estimate the following regression: 


RNOA,,, = Yo + *,RNOA, + YACC, + v, 


and hypothesize that y, < 0. 

We report results based on the time-series means and t-statistics of annual cross- 
sectional regressions (Fama and MacBeth 1973) augmented with the Newey and West 
(1987) correction for autocorrelation. This approach typically generates a conservative es- 
timate of statistical significance (Loughran and Ritter 2000). Results are qualitatively iden- 
tical (albeit more statistically significant) using pooled OLS regressions. Consistent with 
the results in FWY and RSST, we find a statistically negative coefficient on ACC. This 
corroborates the well-known result that the accrual component of earnings is less persistent 
than the cash flow component of earnings (e.g., Sloan 1996, FWY; RSST). 

The regression in Panel B of Table 4 presents the regression results for our growth and 
efficiency decomposition of ACC: 


RNOA,,, = Y + У, ЕМОА, + 750, ЕЕ ҮзДАТ, =: ya(SG, * ААТ) F Vr (5) 
This decomposition allows us to investigate the diminishing marginal returns to investment 
explanation developed by FWY. Recall from Section П that diminishing marginal returns 
to investment will manifest itself in the SG component of accruals. The regressions indicate 
that the coefficients on —AAT and SG are both significantly negative, with a somewhat 
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TABLE 4 
Time-Series Means and t-Statistics for Coefficients from Annual Cross-Sectional Regressions 
of Next Year’s Accounting Rate of Return on This Year’s Accounting Rate of Return and This 
Year’s Total Operating Accruals 
(the sample consists of 106,423 firm-year observations from 1962—2001) 


Panel A: OLS Regressions for Accruals 
RNOA,,, = Yo + у, ВМОА, + у АСС, + vai 


Yo Y - Y2 Adj. R? 
Coefficient 0.029 0.722 0.5506 
(t-statistic) (5.42) (71.31) 
0.041 0.764 —0.131 0.5722 
(8.77) (67.33) (—14.85) 


Panel B: OLS Regressions for Growth and Efficiency Decomposition of Accruals 


RNOA,,, = Yo + YRNOA, + 7,56, — y,AAT, — y,(SG, * AAT) + Vai 


Yo Yı За Үз Vs Adj. R? 
Coefficient 0.034 0.730 —0.041 0.5527 
(t-statistic) (7.25) (69.82) (77.24) 

0.310 0.730 —0.117 0.5667 

(6.35) (72.29) (—15.99) 

0.027 0.728 —0.062 0.5537 

(5.32) (69.85) (-4Л3) ` 

0.042 0.756 —0.102 —0.151 —0.007 0.576 


(10.24) (61.83) (-13.65) (-17.06) (-0.93) 


Reported coefficient estimates and adjusted R?s are the means from 40 annual cross-sectional regressions. 
t-statistics are based on the time-series of the annual regression coefficient estimates using the Fama and 
MacBeth approach augmented by the Newey and West (1987) correction for autocorrelation. 
ACC = total operating accruals defined as the change in Net Operating Assets (NOA). It is calculated as 
[WOA, — NOA,_,)/NOA,_,]; 
NQA = Operating Assets (OA) — Operating Liabilities (OL); 
OA = the sum of Current Operating Assets (COA) and Noncurrent Operating Assets (NCOA); 
OL = the sum of Current Operating Liabilities (COL) and Noncurrent Operating Liabilities (NCOL); 
СОА = Current Assets (Compustat Item #4) — Cash and Short-Term Investments (577) (Compustat Item #1); 
NCOA = Total Assets (Compustat Item #6) — Current Assets (Compustat Item #4) 
— Investments and Advances (Compustat Item #32); 
COL = Current Liabilities (Compustat Item #5) — Debt in Current Liabilities ‘Compustat Item #34); 
NCOL = Total Liabilities (Compustat Item #181) — Current Liabilities (Compustat Item #5) 
— Long-Term Debt (Compustat Item #9); 
SG = sales growth calculated as [(Sales,/Sales, |) — 1], where sales is Compustat Item #12; 


шин (=) - (22 noe мод; #4 


RNOA = return on net operating assets for the current year. It is calculated as income from continuing 
operations (Compustat Item #178) deflated by beginning net operating assets. 








larger and more statistically significant coefficient on —AAT.!* The negative coefficient on 
—AAT is consistent with accounting distortions providing a significant contribution to the 


14 Unreported tests indicate that the coefficient on - ААТ is significantly more negative than the coefficient on SG 
at the 0.01 level. 
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lower persistence of accruals. Note also that since sales are generally based on accrual 
accounting estimates (e.g., accounts receivable), the negative coefficient on sales growth 
may be indicative of either diminishing returns to new investment or accounting distortions 
in sales. In summary, the evidence from the growth versus efficiency regressions is sug- 
gestive of a significant role for accounting distortions in explaining the lower persistence 
of the accrual component of earnings, though it does not rule out a supplementary role for 
diminishing returns to new investment. 


SEC Enforcement Action Results 

Tables 5 and 6 present our final set of tests, investigating the relation between accruals 
and SEC enforcement actions. The purpose of these tests is to examine the role of mana- 
gerial discretion in explaining the lower persistence of the accrual component of earnings. 
Finding a positive contemporaneous relation between accruals and SEC enforcement actions 
would be consistent with the presence of aggressive distortions in accruals that are attrib- 
utable to managerial discretion. Note that these tests provide evidence of a special case of 
the transitory accounting estimation error explanation provided in Dechow and Dichev 
(2002) and RSST. Specifically, following conjectures by Xie (2001) and Chan et al. (2005), 
these tests investigate whether the transitory distortions in accruals are attributable to in- 
tentional managerial manipulation. 

Our dependent variable, SECAction, takes the value of 1 for firm-years that the SEC 
subsequently alleges earnings were overstated in violation of GAAP, and 0 otherwise. Due 
to the dichotomous nature of the dependent variable, we use logistic regression analysis. 
The regression in Panel A of Table 5 confirms the existence of a significant positive relation 
between total operating accruals and the likelihood of an SEC enforcement for overstated 
earnings (see also, Bradshaw et al. 2001; Dechow et al. 1996). 

Panel B of Table 5 employs the RSST decomposition of total operating accruals into 
working capital accruals and noncurrent operating accruals. Recall that both FWY and 
RSST show that each of these two accrual components lead to lower earnings persistence. 
FWY, however, argue that the lower persistence of the noncurrent-operating component of 
accruals is unlikely to be attributable to managerial discretion. In contrast to the arguments 
in FWY, we find that both components of accruals have a strong positive relation with SEC 
enforcement actions, suggesting that noncurrent operating accruals also reflect managerial 
manipulation. These results suggest that both components of accruals are characterized by 
temporarily aggressive accounting distortions that are attributable to managerial discretion. 

Panel C of Table 5 report results from regressions of SEC enforcement actions on the 
growth and efficiency components of accruals. Both components are significantly related 
to SEC enforcement actions. The significance of sales growth is puzzling. A potential 
explanation is that sales growth reflects temporary accounting distortions arising from rev- 
enue manipulation. Consistent with this explanation, prior research has documented that 
revenue manipulation is a leading technique of earnings manipulation (see Dechow et al. 
1995; Dechow and Schrand 2004). Assuming that revenue manipulation explains the sig- 
nificance of sales growth, the negative coefficient on SG in Table 4 may also be driven by 
accounting distortions as opposed to diminishing marginal returns to investment. There are, 


15 The sales growth and efficiency measures potentially contain measurement error for firms engaging in merger 


or acquisition activity during the sample year. As a robustness check, we eliminate all observations with an 
increase in goodwill from prior year and re-estimated the regressions (reducing the sample size to 96,116 
observations). The results are qualitatively similar. 
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TABLE 5 
Logistic Regressions of SEC Enforcement Actions on Total Operating Accruals 
(the sample consists of 76,165 firm-year observations fram 1979 to 2001) 


Panel A: Logistic Regressions for Total Operating Accruals 
SECAction, = ү, + y,RNOA, + у; АСС, + v, 





Model 
Regression Coefficient Yo У: ~; Chi-Square p-value 
Estimated value —6.49 021 - 128 51.77 (0.0001) 


(p-value) (0.0001) (0.2574) (0.0001) 
Panel B: Logistic Regressions for Working Capital and Noncurrent Operating Components of 
Accruals 


SECAction, = ү, + у ЕМОА, + y,AWC, + y,ANCO, + v, 





: Model 
Regression Coefficient Yo У, Уа Үз Chi-Square | p-value 
Estimated value —6.29 0.26 1.24 32.59 (0.0001) 
(p-value) (0.0001) (0.1964) (0.0001) 
—6.36 0.35 1.05 30.73 (0.0001) 
(0.0001) (0.0708) (0.0001) 
—6.39 0.21 0.94 0.84 50.43 (0.0001) 


(0.0001) (0.2636) (0.0001) (0.0002) 


Panel С: Logistic Regressions for Growth and Efficiency Components of Accruals 
| SECAction, = ^, + ,RNOA, + 7,56, — 4AAT, — y,(SG, * AAT, + v, 


Model 
Regression Coefficient Yo У Уа Үз Y4 Chi-Square p-value 
Estimated valie —6.49 0.33 1.36 47.06 (0.0001) 
(p-value) (0.0001) о ree (0.0001) : 
—6.23 0.65 14.81 (0.0006) 
(0.0001) © 0339) (0.0019) 
—6.18 0.34 1.29 24.70 10.0001) 
(0.0001) (0.1039) (0.0001) 
—6.52 0.22 140 0.60 0.27 73.22 20.0001) 


(0.0001) (0.2237) (0.0001) (0.0333) (0.4129) 


We identified firms subject to a SEC enforcement action via a reading of the SEC Accounting and Auditing 
Enforcement Releases from 1979—2001. 
p-values are presented below the coefficients. 
SECAction, = an indicator variable equal to 1 if the firm-year observation is subject to a SEC Enforcement 
Action in year f, and 0 otherwise; 
RNOA = return on net operating assets for the current year. It is calculated as income from continuing 
operations (Compustat Item #178) deflated by beginning net operzting assets; 
ACC = total operating accruals defined as the change in Net Operating Assets (NOA). It is calculated as 
. [WOA, — NOA,_,)/NOA,_,}; 
МОА = Operating Assets (ОА) — Operating Liabilities (OL); 
ОА = the sum of Current Operating Assets (СОА) and Noncurrent Operating Assets (NCOA); 
OL = the sum of Current Operating Liabilities (COL) and Noncurrent Operating Liabilities (ҮСЭГ, 


(continued on next page) 
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TABLE 5 (Continued) 


COA = puru Assets (Compustat Item #4) — Cash and Short-Term Investments (571) (Compustat Item 


#1); 
NCOA = Total Assets (Compustat Item #6) — Current Assets (Compustat Item #4) 
~ Investments and Advances (Compustat Item #32); 
COL = Current Liabilities (Compustat Item #5) — Debt in Current Liabilities (Compustat Item #34); 
NCOL = Total Liabilities (Compustat Item #181) — Current Liabilities (Compustat Item #5) 
— Long-Term Debt (Compustat Item #9); 
Change in working capital (AWC) = [(WC, — WC, /NOA, ,, where WC = (COA — COL); 
Change in noncurrent operating accruals (ANCO) = [(NCO, — NCO,_,)/NOA,.,, where NCO = (NCOA 
— МСОШ, 
С = sales growth calculated as [(Sales,/Sales,.,) — 1], where sales is Compustat Item #12; and - 


$ 
Sales| _ (Sales,,\| /Sales, 
MAS (568) zal NOA; 








TABLE 6 
Regressions of Total Operating Accrual Ranks on Current and Lagged SEC 
Enforcement Actions 
(the sample consists of 76,165 firm-year observations from 1979 to 2001) 


Rank of АСС, = Yo + YıSECAction, + y,SECAction,_, + y,SECAction,_, 
+ ySECActlon, з + v, 


Yo У Ya Үз у Адј. Е: 
Model 1: 0.478 0.155 0.0005 
(394.67) (5.89) 
Model 2: 0479 0.192 —0.078 0.0006 
Coefficient (394.62) | (636) | (-2.51) 
(t-statistic) Model 3: 0.467 0.169 0.023 —0.165 ` 0.0009 
(364.59) (5.18) (0.62) (—4Л1) 
Model 4: 0.460 0.165 0.027 —0.119 —0.081 0.0009 


(339.47) (459) (0.69) | (-2.90) -(-243) 


We assign firms into ten-equal-sized portfolios each year based on the magnitude of ACC. The rank is 
standardized to range from 0 to 1. We use this rank as our dependent variable in the regression analysis above. 
Results are similar if we instead use the raw accrual measure. 
SECAction, = an indicator variable equal to 1 if the firm-year observation is subject to a SEC Enforcement 
Action in year t, and 0 otherwise. We identified firms subject to a SEC enforcement actions via a 
reading of the SEC Accounting and Auditing Enforcement Releases from 1979—2001; 
ACC = total operating accruals defined as the change in Net Operating Assets (NOA). It is calculated as 
[(МОА, — NOA,_,)/NOA,_,]; 
NOA = Operating Assets (OA) — Operating Liabilities (OL); 
OA = the sum of Current Operating Assets (COA) and Noncurrent Operating Assets (NCOA); 
OL = the sum of Current Operating Liabilities (COL) and Noncurrent Operating Liabilities (NCOL); 
COA = Current Assets (Compustat Item #4) — Cash and Short-Term Investments (577) (Compustat Item 


#1); 
NCOA = Total Assets (Compustat Item #5) — Current Assets (Compustat Item #4) — Investments and 
Advances (Compustat Item #32); 
COL = Current Liabilities (Compustat Item #5) — Debt in Current Liabilities (Compustat Item #34); and 
NCOL = Total Liabilities (Compustat Item #181) — Current Liabilities (Compustat Item #5) — Long-Term 
Debt (Compustat Item #9). 


however, two alternative explanations for this result. First, the SEC may disproportionately 
target high-growth firms. Such firms are more likely to be raising new capital, thus making 
them more likely targets for SEC enforcement actions. Second, recall from the numerical 
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example in Section III that permanently aggressive accounting combined with positive 
growth can cause earnings to be temporarily overstated. Thus, the SEC may only target 
cases of permanently aggressive accounting in the presence of positive sales growth, be- 
cause this is the only situation where earnings are overstated. 

We can use the SEC enforcement action data to investigate one additional implication 
of the accounting distortion explanation. Recall from the numerical examples in Table 1 
that the negative relation between accruals and future profitability is consistent with both 
temporary accounting distortions and the interaction of permanently aggressive accounting 
with real investment growth. One difference between these two cases is that temporarily 
aggressive accounting distortions involve the understatement of accruals in subsequent pe- 
riods (see Panel E of Table 1). The interaction of permanently aggressive accounting with 
growth, in contrast, does not involve the understatement of accruals in future periods (see 
Panel C of Table 1). To discriminate between these two explanations, we therefore inves- 
tigate the relation between accruals and lagged SEC enforcement actions (in other words, 
we investigate the relation between SEC enforcement actions and subsequent accruals). If 
temporarily aggressive accounting distortions drive the high accruals at the time of SEC 
enforcement actions, then the subsequent reversal of these distortions should lead to a 
negative relation between accruals and lagged SEC enforcement actions. Note that all ac- 
crual reversals take place in the very next period in our examples. In practice, however, 
accrual reversals may take longer than one year. We therefore test for this possibility in 
Table 6 by examining up to three years of lagged SEC enforcement actions. 

Consistent with the results in Table 5, Table 6 documents a strong positive contem- 
poraneous relation between accruals and SEC enforcement actions (the coefficient on y, is 
always statistically positive). More importantly, the addition of lagged accruals to the re- 
gression reveals evidence of significantly negative coefficients, particularly at the second 
and third lags. This evidence is consistent with the existence of temporarily aggressive 
accounting distortions at the time of SEC enforcement actions that take two to three years 
to reverse. It suggests that, at least for the SEC enforcement action subsample, the properties 
of accruals are attributable to temporarily aggressive accounting rather than the interaction 
of permanently aggressive accounting with growth in real investment. 

The results from our SEC enforcement action tests are summarized in Figure 2. This 
figure plots ACC, RNOA, SG, and AAT for the SEC sample over the 11-year window 
centered on the alleged earnings overstatement. Panel A plots the mean and median values 
of ACC for the SEC sample. The solid line across the middle of the graph plots the average 
ACC for our entire sample of 0.164 as a visual benchmark. The graph clearly shows that 
accruals are unusually high in the year of the alleged earnings overstatement and the two 
preceding years. Accruals then post a sharp decline in the year immediately following the 
overstatement and remain unusually low for the next several years. Panel B plots the mean 
and median values of RNOA for the SEC sample. This graph shows that the sharp reduction 
in accruals between year 0 and year 1 causes a corresponding sharp reduction in RNOA. 
RNOA is unusually high in the years leading up to the SEC enforcement action and un- 
usually low thereafter. Panel C demonstrates that the SG component of accruals is tempo- 
rarily high in the year of the alleged earnings overstatement and temporarily low in the 
following years. This result is consistent with revenue-based earnings manipulations that 
temporarily inflate sales growth. Panel D demonstrates that the AAT component of accruals 
is temporarily low in the year of the alleged earnings overstatement and temporarily high 
in the following years. This result is consistent with temporarily aggressive accounting 
distortions at the time of SEC enforcement actions that reverse in subsequent years. 
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VL CONCLUSION 

This paper examines the role of accounting distortions in explaining the lower persist- 
ence of the accrual component of earnings. A number of recent papers have argued that 
the lower persistence of the accrual component of earnings is more likely to be attributable 
to diminishing marginal returns to new investment and other “‘growth-based” explanations 
rather than to accounting-based explanations. In contrast, our analysis and empirical evi- 
dence suggests that temporary accounting distortions play an important role. At the same 
time, we are unable to rule out a supplementary role for growth-based explanations, such 
as diminishing marginal returns to new investment. We conclude that temporary accounting 
distortions are a significant contributing factor to the lower persistence of the accrual com- 
ponent of earnings. Our analysis of SEC enforcement actions also indicates that some of 
these accounting distortions are attributable to intentional managerial manipulation of 
accruals. 

Our findings reinforce the importance of accounting analysis in the evaluation of fi- 
nancial performance. Accrual accounting seeks to provide more relevant information to 
investors. The increase in relevance, however, comes at the cost of reduced reliability. 
Reduced reliability manifests itself in the form of transitory estimation error in accruals 
and increased opportunities for the opportunistic use of managerial accounting discretion 
in setting accruals. Our evidence indicates that the lower persistence of the accrual com- 
ponent of earnings is a direct manifestation of this crucial trade-off between relevance and 
reliability. Our findings suggest that the current push by accounting standard setters toward 
“fair value” accounting is likely to increase the frequency and magnitude of accounting 
distortions, leading to a reduction in the persistence and reliability of earnings. 

Future research could further examine the impact of this trade-off on the properties of 
accounting numbers. A detailed examination of specific accruals categories (e.g., inventory, 
receivables) could provide detailed corroborating evidence on the role of temporary ac- 
counting distortions. Additional topics for future research include: (1) the determination of 
which accruals are most subject to accounting distortions; (2) the detailed documentation 
of the origination and reversal of such distortions; and (2) the identification of external 
factors facilitating accounting distortions, such as subjective accounting standards and weak 
corporate governance. 
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APPENDIX 
Proof of Growth and Efficiency Decomposition in Equation (4) 


Total Net Accruals = ANOA,/NOA,_, 





= ASales,//Sales, | > AAT,/AT, — (ASales,/Sales,_,)*(AAT,/AT,) 
——————— - 
Sales Change in (4) 
Growth Efficiency Interaction 


The RHS of this expression can be reduced to the LHS as follows: 
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Editorial Data 


The following table contains information about turnaround time for manuscripts (including revisions) 
on which editorial decisions were made in the 12-month period ended December 31, 2005. Turnaround 
time is the number of days between the date that the manuscript was received and the date of the 
editor’s letter to the author(s). 





Number of . Cumnlative Cumulative 
Manuscripts Percent Number Percent 
0 x Days x 30 84 18.06 84 18.06 
31 = Days = 60 183 39.36 267 57.42 
61 = Days = 90 140 30.11 407 87.53 
91 = Days = 120 49 10.54 456 98.07 
121 = Рауз _09 1.93 465 100.00 

Total 465 100.00 


The mean review time was 55.93; the median review time was 77 days. 


New Submissions by calendar year (January 1 to December 31) 


1992 241 
1993 234 
1994 231 
1995 195 
1996 230 
1997 195 
1998 196 
1999 239 
2000 260 
2001 260* 
2002 324 
2003 327 
2004 307 
2005 389 


ж 2001--іп addition to the 260 regular submissions, 68 MSs were submitted to the TAR Quality of 
Earnings Conference. 


The acceptance rate (defined as number of MSs accepted divided by the number of new submissions) 
over the past 5 years is between 12-16% per year. 
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. Titles of journals should not be abbreviated. 

. Multiple works by the same author(s) in the same year are distinguished by letters after the date. 

. Inclusive page numbers are treated as recommended in Chicago Manual section 17.168.B. 


Sample entries are as follows: 


American Accounting Association, Committee on Concepts and Standards for External Financial Reports. 
1977. Statement on Accounting Theory and Theory Acceptance. Sarasota, FL: AAA. 
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Demski, J. S., and D. E. M. Sappington. 1989. Hierarchical structure and responsibility accounting. Journal . 
of Accounting Research 27 (Spring): 40-58. 

Dye, R., B. Balachandran, and R. Magee. 1989. Contingent fees for audit firms. Working paper, Northwestern 
University, Evanston, IL. 

Fabozzi, Е, and I. Pollack, eds. 1987. The Handbook of Fixed Income Securities. 2nd edition. Homewood, 
IL: Dow Jones-Irwin. 

Kahneman, D., P. Slovic, and A. Tversky, eds. 1982. Judgment Under Uncertainty: Heuristics and Biases. 
Cambridge, U.K.: Cambridge University Press. 

Porcano, T. M. 1984a. Distributive justice and tax policy. The Accounting Review 59 (October): 619-636. 

. 1984b. The perceived effects of tax policy on corporate investment intentions. The Journal of the 
American Taxation Association 6 (Fall): 7—19. . 

Shaw, W. Н. 1985. Empirical evidence оп the market impact of the safe harbor leasing law. Ph.D. dissertation, 
The University of Texas at Austin. 

Sherman, T. M., ed. 1984. Conceptual Framework for Financial Accounting. Cambridge, MA: Harvard 
Business School. 





Footnotes: Footnotes are not used for documentation. Textual footnotes should be used only for extensions 
and useful excursions of information that if included in the body of the text might disrupt its continuity. 
Footnotes should be consecutively numbered throughout the manuscript with superscript Arabic numerals. 
Footnote text should be doubled-spaced and placed at the end of the article. 


SUBMISSION OF MANUSCRIPTS 
Authors should note the following guidelines for submitting manuscripts: 


1. Manuscripts currently under consideration by another journal or publisher should not be submitted. The 
author must state that the work is not submitted or published elsewhere. 

2. In the case of manuscripts reporting on field surveys or experiments, four copies of the instrument 
(questionnaire, case, interview plan, or the like) should be submitted. Information that might identify the 
author(s) must be deleted from the instrument. 

3. MS should be submitted via email as Microsoft® Word or PDF files to the ТАЁ office at TARUAZ@ 
email.arizona.edu. A submission letter, a separate cover page, and the MS containing no author identifi- 
cation should be emailed as separate files. The submission fee should be paid online (VISA or MasterCard 
only) at: https://aaahgq.org/AAAforms/journals/tarsubmit.cfm. If unable to pay online, a check in U.S. 
funds, made payable to the American Accounting Association, may be sent to Professor Dan Dhaliwal, 
Senior Editor, The Accounting Review, The University of Arizona, Department of Accounting, McClelland 
Hall 301, 1130 E. Helen Street, Tucson, AZ 85721-0108. The submission fee in U.S. funds is $125.00 
for members and $200.00 for nonmembers of the AAA. Effective January 1990, the submission fee is 
nonrefundable. 

. The author should retain a copy of the paper. 

. Revisions must be submitted within 12 months from request, otherwise they will be considered new 
submissions. 
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COMMENTS 


Comments on articles previously published in The Accounting Review will be reviewed (anonymously) by 
two reviewers in sequence. The first reviewer will be the author of the original article being subjected to 
critique. If substance permits, a suitably revised comment will be sent to a second reviewer to determine its 
publishability in The Accounting Review. If a comment is accepted for publication, the original author will 
be invited to reply. АП other editorial requirements, as enumerated above, also apply to proposed comments. 


POLICY ON REPRODUCTION 


An objective of The Accounting Review is to promote the wide dissemination of the results of systematic 
scholarly inquiries into the broad field of accounting. 

Permission is hereby granted to reproduce any of the contents of the Review for use in courses of 
instruction, as long as the source and American Accounting Association copyright are indicated in any such 
reproductions. 
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Written application must be made to the American Accounting Association, 5717 Bessie Drive, Sarasota, 
FL 34233-2399, for permission to reproduce any of the contents of the Review for use in other than courses 
of instruction—e.g., inclusion in books of readings or in any other publications intended for general distri- 
bution. In consideration for the grant of permission by the Review in such instances, the applicant must 
notify the author(s) in writing of the intended use to be made of each reproduction. Normally, the Review 
will not assess a charge for the waiver of copyright. 

Except where otherwise noted in articles, the copyright interest has been transferred to the American 
Accounting Association. Where the author(s) has (have) not transferred the copyright to the Association, 
applicants must seek permission to reproduce (for all purposes) directly from the author(s). 


POLICY ON DATA AVAILABILITY 


The following policy has been adopted by the Executive Committee in its April 1989 meeting. "An 
objective of (The Accounting Review, Accounting Horizons, Issues in Accounting Education) is to proyide 
the widest possible dissemination of knowledge based on systematic scholarly inquiries into accounting as 
a field of professional research, and educational activity. As part of this process, authors are encouraged to 
make their data available for use by others in extending or replicating results reported in their articles. 
Authors of articles which report data dependent results should footnote the status of data availability ше 
when pertinent, this should be accompanied by information on how the data may be obtained.” 
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MAY 2006 PLACEMENT ADS 


The deadline for information to be included in the “Placement Information” section 
is two months prior to the date of publication. The Review is published in January, 
April, July, and October. Placement advertising and replies should be sent to the 
American Accounting Association, 5717 Bessie Drive, Sarasota, FL 34233-2399, 
ATTENTION: ADVERTISING DEPT. All replies to box numbers should be on sep- 
arate sheets to facilitate forwarding. 


ROBERT MORRIS UNIVERSITY, Department of Accounting and Taxation, invites applications 
for anticipated faculty positions beginning August 2006. Candidates with teaching and research 
interests in all areas of accounting will be considered. Candidates should either possess a Ph.D. 
or be nearing completion of their degree program. Appointment will be at the Assistant, Associate, 
or Full Professor level, depending on the applicant’s experience and accomplishments. Located 
near Pittsburgh, Robert Morris University is a private institution, enrolling approximately 5,000 
students. Teaching is mostly at the undergraduate level, although opportunities to teach graduate 
courses are available. Creating a high-quality, innovative learning environment is the Department’s 
top priority. Scholarly activity is encouraged and supported. Salary and benefits are competitive. 
Candidates should submit a letter of application, curriculum vitae, and names and addresses for 
three references to: Professor James E. Rebele, School of Business, Robert Morris University, 
6001 University Boulevard, Moon Township, PA 15108. Applications may be submitted electron- 
ically to: rebele@nnu.edu. Robert Morris University is an Equal Opportunity Employer. 


UNIVERSITY OF NOTRE DAME, Department of Accountancy, invites applications for position 
openings at tbe Assistant and Associate Professor levels. AII research areas will be considered. 
Applicants for Assistant Professor positions should expect to complete doctoral degree require- 
ments before the start of the contract period. All candidates should have a strong commitment to 
scholarly research and excellence in teaching. Salary is very competitive and research support is 
excellent. Please send curriculum vitae and an example of scholarly work to: Professor Tom 
Schaefer, Chair, Department of Accountancy, Room 102D, Mendoza College of Business Ad- 
ministration, University of Notre Dame, Notre Dame, IN 46556-5646. The улан of Notre 
Dame is an Equal Opportunity / Affirmative Action Employer. 


OREGON STATE UNIVERSITY invites applicants for a tenure-track position in Accounting at 
the rank of Assistant, Associate, or Full to begin September 2006, subject to funding. Teaching 
duties will be in auditing, financial, and/or tax. Responsibilities include research, quality teaching, 
and professional activities in the field of accounting. Candidates should hold the appropriate 
doctorate. Professional certification is desirable. Salary is competitive, fringe benefits are excellent. 
Oregon State University is located in Corvallis, which is rated the second-best small city in 
America. Interested applicants should submit a letter of interest, current уйа, recent teaching 
evaluations, evidence of scholarly work, and a list of three references to: Dr. Roger Graham, 
College of ‘Business, 200 Bexell Hall, Oregon State University, Corvallis, Oregon 97331-2603. 
For full consideration apply by February 28, 2006. OSU is an Equal Opportunity Employer and 
seeks diversity. 
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NORTHEASTERN ILLINOIS UNIVERSITY, College of Business and Management, invizes ap- 
plications for Ph.D./D.B.A. with specialization in Accounting from an AACSB-accredited College 
of Business program in one or both of the following areas: Accounting Principles and Firancial 
Accounting. The position is flexible to accommodate other teaching and research interests. One 
tenure-track position at ће Assistant/ Associate Professor rank beginning 8/06, is anticipated for 
teaching undergraduate and graduate courses. Potential for excellent teaching and research. Se- 
lection process begins on January 15, 2006 and continues until position is filled. Send application 
letter, resume with email address, transcripts, and names of three references to: Dr. C. T. Chen, 
Chair, Department of Accounting, Business Law and Finance, Northeastern Illinois University, 
5500 М. St. Louis Avenue, Chicago, IL 60625. Fax: 773-442-6110. Email: C-Chen3 @пеш ейп. 
AA/EOE. 


UNIVERSITY OF TENNESSEE AT CHATTANOOGA, College o£ Business Administration, in- 
vites applications for an expected tenure-track Assistant or Associate Professor position, primarily 
teaching managerial accounting in the M.B.A. program. Candidates should have a Ph.D. or D.B.A. 
in Accounting or have an estimated completion date of August 2026. Candidates must be com- 
mitted to excellence in teaching. We are especially interested in faculty who want to combine 
teaching and research with a close working relationship with the Chattanooga community. Salary 
and benefits are AACSB competitive. Review of applicants will begin on February 1 and continue 
until the position is filled. For information, contact Dr. Rick Turpin, University of Tennessee at 
Chattanooga, College of Business Administration, Department of Accounting and Finance, Dept. 
6206, 615 McCallie Avenue, Chattanooga, TN 37403. Telephone: (423) 425-4664; Email: Rick- 
Turpin @utc.edu. Minority applicants are strongly encouraged to apply. The University of Ten- 
nessee at Chattanooga is an Equal Employment Opportunity, Affirmative Action, Title VI ard IX, 
Section 504, ADA Institution. 


McGILL UNIVERSITY, Desautels Faculty of Management, has tenure-track positions available 
at the Assistant, Associate, and Full Professor levels. Applications in all areas of accounting are 
invited. Applicants should send their curriculum vitae, three letters of recommendation, recent 
publications and working papers to: Ms. Karen Robertson, Faculty of Management, McGill Uni- 
versity, 1001 Sherbrooke, St. West Montreal, Quebec, CANADA H3A 105. In accordance with 
Canadian Immigration requirements, priority consideration will be given to Canadian citizens and 
permanent residents of Canada. McGill University is committed to equity in employment. 


Experienced PhD, CPA seeks a visiting position to teach financial, managerial, or international 
accounting courses for the 2006-2007 academic year. To this assignment, I bring a wealth of 
national and international teaching experience, an established publication record, and excellent 
teaching evaluations. Teaching assignments include faculty positions in the U.S. and the U.K. T 
participate regularly in AAA meetings and am a Regional Coordinator for the International Sec- 
tion. A curriculum vitae will be furnished upon request. You may contact me at profranQ 
hotmail.com. 


MISSISSIPPI STATE UNIVERSITY, College of Business and Industry, School of Accountancy, 
invites applications for an Assistant or Associate Professor of Accounting, a tenure-track position 
beginning Fall 2006. Responsibilities include teaching undergraduate and graduate courses in 
Taxation. Doctorate degree in accounting is required. The position requires excellence in teaching 
based on prior experience as a Graduate Teaching Assistant or as an Instructor or Assistant 
Professor, ability to do research leading to publication in nationally refereed journals, апі an 
interest and willingness to be involved in professional accounting activities. An interest in adv-sing 
and mentoring students is expected, including participation in the Ph.D. program. Professional 
certification and professional accounting experience is bighly desirable. Salary is competitive. 
Send letter of application, current curriculum vitae, and names and contact information of three 
references to: School of Accountancy, Attn: Search Committee Chair, PO Box EF, Mississippi 
State, MS 39762. MSU is an AA/EOE. 
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309 COLLEGE OF MOUNT ST. JOSEPH invites applications for a full-time tenure track Assistant 


Professor in Accounting in the Department of Business Administration beginning Fall 2006. 
Candidates with a doctorate in Accounting are preferred; ABDs will be considered but must 
.complete the degree prior to start date. Required competencies include demonstrated effectiveness 


“in college teaching in Accounting Principles, Intermediate Accounting, and other courses in the 


applicant’s area of interest; and demonstrated ability to successfully conduct scholarly research 


` _ and publish, consistent with AACSB standards. Preference will be given to individuals with the 


ability to enhanze teaching effectiveness with use of relevant technology. Applicants should send 
a cover letter, curriculum vita, evidence of teaching effectiveness, and arrange for three letters of 
recommendation to be sent to: Mr. Jim Brodzinski, Chair, Department of Business Administration, 
College of Maunt St. Joseph, 5701 Delhi Rd., Cincinnati, ОН 45233, or by email: Jim. 
Brodzinski @ mail. msj.edu. 


EAST CAROLINA UNIVERSITY, Department of Accounting, invites applications for a tenure- 
track Assistant/Associate Professor position in Accounting for the 2006-2007 academic year. 
Applicants should have or expect to have a Ph.D. in Accounting by August 2006, and appropriate 
professional cextification. Our current needs include Government Accounting and Internal Audit- 
ing. A 12-s.b. load is standard with release time granted for research and publication productivity. 
Send credentials (cover letter which describes specific skills and experience relevant to this po- 
sition, current resume / vita, evidence of research/ publication activity, current teaching evaluations, 
and three original letters of reference) to Dr. Dan Schisler, Chair, East Carolina University, 3208- 
A Bate Building, Greenville, NC 27858-4353. Applicants must comply with the Immigration 
Reform and Control Act. ECU is an Equal Opportunity/Affirmative Action University and ac- 
commodates individuals with disabilities. Individuals requesting a disability accommodation 
should call the ECU Office Disability Support Services at (252) 328-6799 (Voice/TTY /Relay). 
Official transcripts are required upon employment. 


TRINITY UNIVERSITY invites applications for the position of Jesse H. Jones Distinguished 
Professor of Accounting effective August 2007. The position requires a Ph.D./D.B.A. in Ac- 
counting, an established record of zesearch, a commitment of continued scholarship, and excel- 
lence in teaching. Candidates should have the qualifications necessary for appointment as a Pro- 
fessor and will be expected to have an interest in teaching Financial Accounting Theory and 
Regulation. The teaching load is two courses per semester. The salary is competitive. The AACSB- 
accredited Department of Business Administration offers bachelor's and master's degrees in Ac- 
counting. Applicants should send a letter of interest, curriculum vitae, and: a sample of written 
work to: Dr. Sankaran Venkateswar, Chair of Search Committee, Department of Business 
Administration, Trinity University. One Trinity Place, San Antonio, TX 78212-7200; Email: 
svenkate G'tririty.edu. Review of applications will begin immediately, and the position will remain 
open until filled. Trinity University is an Equal Opportunity Employer. 


UNIVERSITY OF NORTH CAROLINA, GREENSBORO, Bryan School of Business and Eco- 
nomics, Department of Accounting, invites applications for anticipated positions at the Assistant, 
Associate, or Professor levels, beginning August 1, 2006. Our primary needs are in financial, 
accounting information systems, and auditing, but all will be considered. A doctorate in account- 
ing ‘or an ABD nearing completicn is required. Interested candidates should submit a letter of 
application, vita, and the names of three references to: Professor Bill Harden, Chair, Search 
Committee, PO Box 26165, Greensboro, NC 27402-6165. Review of applications will begin 
September 1, 2005 and continue until the position is filled. EEO/AA. 
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UNIVERSITY OF NORTH CAROLINA, GREENSBORO, Bryan School of Business and Eco- 
nomics, Department of Accounting, invites applications for anticipated positions at the Assistant, 
Associate, or Professor levels, beginning August 1, 2006. Our primary needs are in financial, 
accounting information systems, and auditing, but all will be considered. A doctorate in account- 
ing or an ABD nearing completion is required. Interested candidates should submit a letter of 
application, vita, and the names of three references to: Professor Bill Harden, Chair, Search 
Committee, PO Box 26165, Greensboro, NC 27402-6165. Review of applications will begin 
September 1, 2005 and continue until the position is filled. EEO/AA. 


THE BRATTLE GROUP provides consulting services and expert testimony on economic, finance, 
regulatory, and strategic issues. We are seeking an Associate to join our Finance consulting 
practice area in the Cambridge, Massachusetts office. The ideal candidate is a recent graduate 
with an Accounting Ph.D. or an Accounting Professor interested in a full-time consulting career. 
Candidates may be a CPA or have similar experience coupled with a recent M.B.A. degree. 
Practical experience should be in the areas of financial reporting and disclosure. To be considered, 
please visit the Careers section of our website and submit your materials there: http:// 
www.brattle.com/Careers/Default.asp. Alternatively, materials may be faxed to our Washington, 
D.C. recruiting office at: (202) 419-3335. The Brattle Group is an Equal Opportunity Employer. 


ILLINOIS WESLEYAN UNIVERSITY, Department of Business Administration, invites appli- 
cations for two tenure-track positions in Accounting at the rank of Assistant Professor. The normal 
teaching load is three courses and two preparations per semester. Illinois Wesleyan seeks candi- 
dates with doctorates in accounting. ABDs nearing completion aad others with extraordinary 
qualifications are also encouraged to apply. Starting date is August 2006. Salaries will be com- 
petitive. Illinois Wesleyan University has distinguished itself as an independent, liberal arts uni- 


versity. To apply, send a letter of application, curriculum vitae, three reference letters, and a 
transcript. In the application letter the successful candidate should be able to demonstrate a com- 
mitment to teaching business within a liberal arts setting. Review of applications will begin 
immediately. Send applications to: Robert A. Kearney, Chair, Department of Business Adminis- 
tration, Illinois Wesleyan University, PO Box 2900. Bloomington, IL, 61702-2900. 


LONG ISLAND UNIVERSITY, BROOKLYN CAMPUS invites applications for a full-time 
tenure-track faculty position in Accounting at the rank of Assistant Professor beginning Fall 2006 
in the Department of Accounting, Taxation and Law. We seek an individual whose scholarly 
attainments are consistent with tenure-track expectations and who is committed to high-quality 
teaching, research, and professional service. Minimum Quslifications: Ph.D. in Accounting is 
required. Or, a Master's Degree in Accounting and a Ph.D. in a related field are required. Can- 
didates with current CPA, CMA, or CIA are preferred. (ABDs will also be.considered with an 
obligation to promptly complete all degree requirements). Teaching experience at the undergrad- 
uate and/or graduate levels is desired. The Application Process: Please attach letter of interest 
and c.v to an email with subject heading “Tenure-track Faculty Position in Accounting” and send 
it to: Roberta.Scotto@liu.edu; or, send a hard copy to: Search Committee/ Accounting Faculty 
Position, c/o Ms. Roberta Scotto, Department of Accounting, Taxstion and Law, Long Island 
. University, One University Plaza-H700, Brooklyn, NY 11201. 
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FLORIDA INTERNATIONAL UNIVERSITY, School of Accounting, is seeking faculty members 
for visiting professor positions for Fall 2006 and/or Spring 2007 to teach undergraduate- and 
graduate-level tax courses. Appointments at the Assistant, Associate, and Full professor ranks 
will be considered. Positions will be open until filled. Ph.D., D.B.A. or equivalent terminal degree 
required. A high degree of competence is expected in teaching at both the undergraduate and 
graduate levels. Professional certification is preferred. Salary is competitive and negotiable based 
upon qualifications and teaching load. The School is separately accredited by the AACSB. Un- 
dergraduate tax courses are taught at the University Park campus and Biscayne Bay campus. 
Graduate tax courses are taught on Saturdays at Biscayne Bay campus, and on Monday and 
Tuesday mornings at Downtown Miami campus. Contact Cherie J. Hennig, Faculty Director, 
MST Program, School of Accounting, FIU, Miami, FL 33199; Phone: (305) 348-3266, Fax: (305) 
348-2914, Email: hennigc @fiu.edu. 


UNIVERSITY OF MINNESOTA, DULUTH, Labovitz School of Business and Economics, has 
a tenure-track position in health care management, beginning Fall 2006. For a complete position 
description and information on how to apply, please visit our website at http: // www.d.umn.edu/ 
Sbe/faculty/openpositions.php. The University of Minnesota is an Equal Opportunity Educator 
and Employer. 


UNIVERSITY OF ALABAMA, HUNTSVILLE invites applications for a tenure-track Assistant/ 
Associate Professor of Accounting starting August 2006. Teaching specialization open but pref- 
erence given to Managerial, Cost, Governmental, or Tax Accounting. Candidates must hold Ph.D. 
in Accounting from an AACSB-accredited institution. Associate Professor requires established 
record of research and quality teaching at both graduate and undergraduate levels. Previous pro- 
fessional experience is desirable. The business school is AACSB-accredited and has Compustat 
and Securities return dataset. Nestled in the foothills of the Appalachian Mountains, the population 
of the area is 350,000. The area has award-winning schools, excellent transportation, and numer- 
ous cultural and recreational opportunities. Send letter of interest, vita, and names and addresses 
of three references to: Dr. Dorla Evans, Chair of Accounting Search Committee, College of 
Administrative Science, UAH, Huntsville, Al 35899. Refer position number 06/07-182 on all 
correspondence. Review of applications begins immediately. The EE will remain open until 
filled. UAH is an AA/EEO institution. 


TEXAS TECH UNIVERSITY, Area of Accounting of the Rawls College of Business, invites 
applications for a one-year visiting position at the rank of Assistant or Associate Professor to 
begin in the fall semester of 2006. Our primary area of need is Financial Accounting: Principles 
of Financial Accounting and Intermediate Accounting. Preference will be given to candidates with 
an earned dociorate in accounting, but ABDs will be considered. Applicants should send a cover 
letter, curriculum vita, and a list of three references to: Dr. Ralph Viator, Search Committee Chair, 
Texas Tech University, Rawls College of Business, Lubbock, Texas 79409-2101. Alternately, 
materials may be sent electronically to: Ralph.ViatorGttu.edu. 


WESTERN ILLINOIS UNIVERSITY, Department of Accountancy, invites applications for Chair 
of the Department. The chair provides leadership in the areas of teaching, research, and service; 
represents the interests of the Department to the College; and facilitates fundraising and mainte- 
nance of AACSB accreditation. The position is at the rank of tenured Full Professor, and excellent 
teaching, research, and service credentials are required. Salary is AACSB competitive. Appoint- 
ment may commence July 1, 2005, January 1, 2007, or July 1, 2007. A letter of application, 
current vita, znd the names and contact information of at least three references should be sent to: 
Accountancy Chair Search Committee, Dean’s Office, College of Business and Technology, West- 
ern Illinois University, 1 University Circle, Macomb, IL 61455-1390. A full position description 
is available at http: // www.win.edu/employment/emp.sphp?id— 335. 
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322 MONTANA STATE UNIVERSITY, College of Business, seeks applicants to fill one tenure-track 
position in Accounting for Fall 2006. Montana State University has an enrollment of approxi- 
mately 12,000 students and is located in Bozeman, an attractive, culturally alive small city, 90 
miles north of Yellowstone National Park. Fully accredited by the AACSB, the College of Busi- 
ness offers a Bachelor of Science in Business with options in Accounting, Finance, Management, 
and Marketing, and a Master of Professional Accountancy. Colleagues in all areas of accounting 
will be considered, however, our greatest teaching needs are in the areas of accounting theory/ 
research and systems. In addition, research activities and service consistent with the College’s 
mission are expected. The position requires a Ph.D. or D.B.A. in Accounting or a closely related 
field from an AACSB-accredited university. Preference will be given to those who have completed 
their doctoral degrees or will complete them prior to beginning work at Montana State University. 
For more information and complete job descriptions with application requirements and deadlines, 
please visit: http:// www.montana.edu/msuinfo/jobs/faculty / or contact: Anne L. Christensen at 
аппес @ montana.edu or (406) 994-2043. ADA/AA/EO/ Veteran's Preference. 


UNIVERSITY OF COLORADO AT DENVER, Business School and Health Sciences Center, 
seeks applicants for two tenure-track faculty positions in accounting. Minimum qualifications for 
Assistant Professor include a Ph.D. in accounting and a commitment to excellence in research 
and teaching. Candidates for advanced rank (Associate or Full Professor) must demonstrate sig- 
nificant accomplishments in research, teaching and service. Specialization in managerial account- 
ing, auditing and fraud examination, or accounting information systems is preferred, though all 
areas will be considered. Applicants who demonstrate proficiency using new accounting or in- 
formation technologies in their teaching are especially encouraged to apply. Review of applications 
begins immediately, and will continue until the positions are filled. Both appointments begin Fall 
Term 2006, or as negotiated. Applicants for Visiting Positions are also encouraged to apply. Both 
positions are dependent on funding availability. The Carnegie Foundation has just classified 
UCDHSC as RU/VH: Research Universities (very high research activity). For full description, 
please see: http://www.ucdhsc.edu/admin/hr/facpos.htméasstprof. The Business School web- 
site is: http: // www.cudenver.edu/ Academics /Colleges/Business/default.htm. Contact: Professor 
Bruce R. Neumann, Director of Accounting Programs, PO Box 173364, Campus Box 165, Denver, 
CO 80217-3364; or Bruce.Neumann @ cudenver.edu. 


UNIVERSITY OF LOUISIANA, MONROE invites applications for a tenure-track Assistant/ 
Associate Professor of Accounting with a primary teaching emphasis in auditing, starting August 
2006. The College of Business at ULM promotes a balance between teaching and intellectual 
contributions, combined with professional service to the university, discipline, and community. 
The Accounting Program and the College of Business Administration are both accredited by 
AACSB International. Candidates must have an earned doctorate (or ABD nearing completion) 
with appropriate professional certification and/or experience. Applicants should provide evidence 
of excellence in teaching and scholarship, curriculum vitae, transcripts, and names of three ref- 
erences. Applications will be considered until the position is filled; direct applications to Dr. 
Michelle McEacharn, The University of Louisiana at Monroe, 700 University Avenue, Monroe, 
LA 71209-0110. The University is an Equal Opportunity/ Affirmative Action Employer located 
in a metropolitan area of 150,000 persons. For more information, visit the ULM website: 
http:// www.ulm.edu. 
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ALABAMA STATE UNIVERSITY, College of Business Administration, Department of Account- 
ing and Finance, invites applications for an Assistant/Associate Professor of Accounting (JC- 
2022A). This is a nine-month, tenure-track position for the 2006 Fall Semester. The successful 
applicant will teach at the undergraduate and graduate level, engage in research and publication, 
participate in university-wide service and professional activities, mentor and academic advising 
of students. Qualifications: Ph.D. in Accounting from an AACSB International accredited school. 
Demonstrate research and publication in refereed journals. Strong leadership, administrative skills, 
and ability to work effectively in a shared governance environment is preferred. Previous faculty 
experience is required. Prior business experience and ability to interact with the business com- 
munity are desirable. Salary: Competitive, Starting date: 2006 Fall Semester, immediately follow- 
ing selection and approval of a candidate. To apply: Interested applicants should submit an em- 
ployment application form listing the names, address and phone numbers/email address of the 
last three employers on the reference sheet, transcripts, and resume to: Office of Personnel and 
Human Relations, Alabama State University, PO Box 271, Montgomery, AL 36101. 


UNIVERSITY OF TEXAS-PAN AMERICAN invites applications for Department Chair of Ac- 
counting and Business Law to begin Fall 2006. Salary is AACSB competitive and commensurate 
with achievements and experience. Successful candidate should be eligible for tenure at rank of 
Associate or Full Professor. Candidates should have doctorate degree in accounting, strong lead- 
ership skills, ability to work and communicate effectively in a participative, collegial, cultur- 
ally diverse environment, and willingness to develop and maintain fruitful relationships with 
professionals ia the community. For more information please visit http://www.utpa.edu/ 
humanresources/employment/coba.html. Interested candidates should send a letter of interest and 
comprehensive vita to: Dr. Rubik Atamian, Search Committee Chair, Department of Accounting 
and Business Law, COBA, UTPA, 1201 W. University Drive, Edinburg, TX 78541-2999. Review 
of applications begins immediately, and position will remain open until filled. UTPA is an Equal 
Opportunity / Affirmative Action Employer. Women and minorities are encouraged to apply. 
DEVRY UNIVERSITY seeks an Accounting/Finance Curriculum Manager and Full-time Faculty 
for Accounting and Finance courses. Individual would be responsible for both development and 
management of curriculum and faculty, plus teaching. The development component includes re- 
sponsibility for course curriculum guides and web-related course development where “Best Prac- 
tices" are utilized. Curriculum managers work with staff, faculty, and publishers to help assure 
that the curriculum is current with existing business practices and one of high quality. Management 
of curriculum includes approval of new faculty plus working with these individuals as they prepare 
for their first teaching experience for KGSM/DeVry University. Specific requirements: Earned 

' doctorate plus academic and professional accounting and/or finance expertise; understanding of 
curriculum development processes; excellent teaching practices; computer literacy; excellent com- 
munication skills. Send resume and contact information for three references to: Dori Lombard at 
diombard @ keller.edu. 
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ST. MARY'S UNIVERSITY, Bill Greehey School of Business invites applications to fill a tenure- 
track position in accounting at the assistant/associate/full professor level. Discipline expertise in 
auditing and information systems strongly desired. Expertise in other area(s) of accounting is a 
plus. Candidates are required to hold a PhD or DBA from an AACSB-accredited institution, and 
demonstrate the ability to bring excellence and innovation into the classroom, maintain a strong 
and continuing record of scholarship consistent with the teacher-scholar model, and possess | 
attributes leading to successful professional service. Salary is competitive. The position begins as - 
` early as Fall 2006. interested candidates should send an application letter, current vita, and 
names with contact information for three references to: Dr. Tom Madison, Chair, Department of — 
:. Accounting, Bill Greehey School of Business, St. Mary's University, One Camino Santa Maria, San : 
Antonio, TX 78228-8607. Deadline is June 1, 2006. 


